Query Match 82.1%; Score 23; DB 11; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

lllllllllllllllllllllll 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 14 
US-10-372-095-23 

Sequence 23, Application US/10372095 
Publication No. US20030162256A1 
GENERAL INFORMATION: 
APPLICANT: Juppner, Harald 
APPLICANT: Rubin, David A. 

TITLE OF INVENTION: PTHIR and PTH3R Receptors, Methods and Uses Thereof 
^...PTLE -REFERENCE : 0609.4740002 
CURRENT APPLICATION NUMBER: US/10/372,095 
CURRENT FILING DATE: 2003-02-25 
PRIOR APPLICATION NUMBER: 09/449,632 
PRIOR FILING DATE: 1999-11-30 
PRIOR APPLICATION NUMBER: US 60/110,467 
PRIOR FILING DATE: 1998-11-30 
NUMBER OF SEQ ID NOS : 25 
SOFTWARE: Patentin Ver. 2.1 
SEQ ID NO 23 
LENGTH: 36 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-372-095-23 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

lllllllllllllllllllllll 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 15 
US-10-340-484-2 

; Sequence 2, Application US/10340484 

; Publication No. US20030171288A1 

; GENERAL INFORMATION: 

; APPLICANT: Stewart, Andrew F. 

; TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 

; TITLE OF INVENTION: Drugs 

; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/lO/340 , 484 

; CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 

; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 



PRIOR APPLICATION NUMBER: 60/368,955 
PRIOR FILING DATE: 2002-03-28 
PRIOR APPLICATION NUMBER: 60/379,125 
PRIOR FILING DATE: 2002-05-08 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: Patentin Ver. 2.1 
SEQ ID NO 2 
LENGTH: 36 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-340-484-2 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

Illllllllllllllllllllll 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 16 
US-10-340-484-3 

; Sequence 3, Application US/10340484 
; Publication No. US20030171288A1 
; GENERAL INFORMATION: 
; APPLICANT: Stewart, Andrew F. 

; TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 
; TITLE OF INVENTION: Drugs 
; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/10/340,484 
; CURRENT FILING DATE: 2003-01-10 
; PRIOR APPLICATION NUMBER: 60/347,215 
; PRIOR FILING DATE: 2002-01-10 
; PRIOR APPLICATION NUMBER: 60/353,296 
; PRIOR FILING DATE: 2002-02-01 
; PRIOR APPLICATION NUMBER: 60/368,955 
; PRIOR FILING DATE: 2002-03-28 
; PRIOR APPLICATION NUMBER: 60/379,125 
; PRIOR FILING DATE: 2002-05-08 
; NUMBER OF SEQ ID NOS: 27 
; SOFTWARE: Patentin Ver. 2.1 
; SEQ ID NO 3 
; • LENGTH: 36 
TYPE: PRT 

ORGANISM: Equus caballus 
US-10-340-484-3 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

Illllllllllllllllllllll 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 17 
US-10-340-484-4 

; Sequence 4, Application US/10340484 
; Publication No. US20030171288A1 
; GENERAL INFORMATION: 

APPLICANT: Stewart, Andrew F. 

TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 
; TITLE OF INVENTION: Drugs 
; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/lO/340 , 484 

; CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 

; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 

; PRIOR APPLICATION NUMBER: 60/368,955 

; PRIOR FILING DATE: 2002-03-28 

; PRIOR APPLICATION NUMBER: 60/379,125 

; PRIOR FILING DATE: 2002-05-08 

; NUMBER OF SEQ ID NOS : 27 

; SOFTWARE: Patentin Ver. 2.1 

; SEQ ID NO 4 

LENGTH: 36 

TYPE: PRT 

ORGANISM: Bos taurus 
US-10-340-484-4 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

lllllllllllllllllllllll 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 18 
US-10-340-484-5 

; Sequence 5, Application US/10340484 

; Publication No. US20030171288A1 

; GENERAL INFORMATION: 

; APPLICANT: Stewart, Andrew F. 

; TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 

; TITLE OF INVENTION: Drugs 

; FILE REFERENCE: 252 00-501 

; CURRENT APPLICATION NUMBER: US/lO/34 0 , 4 84 

CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 

; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 

; PRIOR APPLICATION NUMBER: 60/368,955 

; PRIOR FILING DATE: 2002-03-28 

; PRIOR APPLICATION NUMBER: 60/379,125 

; PRIOR FILING DATE: 2002-05-08 

; NUMBER OF SEQ ID NOS: 27 

; SOFTWARE: Patentin Ver. 2.1 



SEQ ID NO 5 
LENGTH: 36 
TYPE: PRT 

ORGANISM: Ovis aries 
US-10-340-484-5 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred, No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

lllllllllllllllllllllll 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 19 
US-10-340-484-6 

; Sequence 6, Application US/10340484 
; Publication No. US20030171288A1 
; GENERAL INFORMATION: 
; APPLICANT: Stewart, Andrew F. 

TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 
; TITLE OF INVENTION: Drugs 
; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/lO/34 0 , 484 

; CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 

; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 

; PRIOR APPLICATION NUMBER: 60/368,955 

; PRIOR FILING DATE: 2002-03-28 

; PRIOR APPLICATION NUMBER: 60/379,125 

; PRIOR FILING DATE: 2002-05-08 

; NUMBER OF SEQ ID NOS : 27 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 6 

LENGTH: 3 6 

TYPE: PRT 

ORGANISM: Canis familiaris 
US-10-340-484-6 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIIIIIMIIIIIIIIIIIM 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 20 
US-10-340-484-7 

; Sequence 7, Application US/10340484 
; Publication No. US20030171288A1 
; GENERAL INFORMATION: 
; APPLICANT: Stewart, Andrew F. 



TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 
; TITLE OF INVENTION: Drugs 
; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/ 10/34 0 , 484 

; CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 

; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 

; PRIOR APPLICATION NUMBER: 60/368,955 

; PRIOR FILING DATE: 2002-03-28 

; PRIOR APPLICATION NUMBER: 60/379,125 

; PRIOR FILING DATE: 2002-05-08 

; NUMBER OF SEQ ID NOS : 27 

SOFTWARE: Patentin Ver, 2.1 
; SEQ ID NO 7 

LENGTH: 36 

TYPE : PRT 

ORGANISM: Felis catus 
US-10-340-484-7 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIMIIIIIIIIIIIIIIIIIII 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 21 
US-10-340-484-8 

; Sequence 8, Application US/10340484 
; Publication No. US20030171288A1 
; GENERAL INFORMATION: 
; APPLICANT: Stewart, Andrew F. 

; TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 
; TITLE OF INVENTION: Drugs 
; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/lO/34 0 , 484 

; CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 

; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 

; PRIOR APPLICATION NUMBER: 60/368,955 

; PRIOR FILING DATE: 2002-03-28 

; PRIOR APPLICATION NUMBER: 60/379,125 

; PRIOR FILING DATE: 2002-05-08 

; NUMBER OF SEQ ID NOS: 27 

SOFTWARE: Patentin Ver. 2.1 
; SEQ ID NO 8 

LENGTH: 36 

TYPE : PRT 

ORGANISM: Orycctolagus cuniculus 
US-10-340-484-8 



Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

lllllllllllllllllllllll 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 22 
US-10-340-484-9 

; Sequence 9, Application US/10340484 
; Publication No. US20030171288A1 
; GENERAL INFORMATION: 

APPLICANT: Stewart, Andrew F. 

TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 
; TITLE OF INVENTION: Drugs 
; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/lO/34 0 , 484 

; CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 

; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 

; PRIOR APPLICATION NUMBER: 60/368,955 

; PRIOR FILING DATE: 2002-03-28 

; PRIOR APPLICATION NUMBER: 60/379,125 

; PRIOR FILING DATE: 2002-05-08 

; NUMBER OF SEQ ID NOS : 27 

SOFTWARE: PatentlnVer. 2.1 
; SEQ ID NO 9 

LENGTH: 36 

TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-10-340-484-9 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 2 8 

lllllllllllllllllllllll 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 23 
US-10-340-484-10 

; Sequence 10, Application US/10340484 

; Publication No. US20030171288A1 

; GENERAL INFORMATION: 

; APPLICANT: Stewart, Andrew F, 

TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 
; TITLE OF INVENTION: Drugs 
; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/lO/34 0 , 484 

; CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 



; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 

/ PRIOR APPLICATION NUMBER: 60/368,955 

; PRIOR FILING DATE: 2002-03-28 

; PRIOR APPLICATION NUMBER: 60/379,125 

; PRIOR FILING DATE: 2002-05-08 

; NUMBER OF SEQ ID NOS: 27 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 10 

LENGTH: 36 

TYPE : PRT 

ORGANISM: Mus musculus 
US-10-340-484-10 

Query Match 82.1%; Score 23; DB 12; Length 36; 

Best Local Similarity 100.0%; Pred. No. 7.3e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIMIIIIIIIIIIMIIMI 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 24 
US-10-014-162-112 

Sequence 112, Application US/10014162 
Publication No. US2003 0032096A1 
GENERAL INFORMATION: 
APPLICANT: Usdin, Ted B. 
APPLICANT: Hoare, Samuel R.J. 

TITLE OF INVENTION: PARATHYROID HORMONE RECEPTOR LIGANDS 
FILE REFERENCE: NIH175.001C1 
CURRENT APPLICATION NUMBER: US/lO/014 , 162 
CURRENT FILING DATE: 2001-12-11 
PRIOR APPLICATION NUMBER: PCT/USOO/1677 
PRIOR FILING DATE: 2000-06-15 
PRIOR APPLICATION NUMBER: 60/139335 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS: 112 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 112 
LENGTH: 39 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-014-162-112 

Query Match 82.1%; Score 23; DB 15; Length 39; 

Best Local Similarity 100.0%; Pred. No. 7.8e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIIIIIIIMIIIIIIIIIII 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 25 



US-09-169-786-12 

; Sequence 12, Application US/09169786B 

; Patent No, US20020025929A1 

; GENERAL INFORMATION: 

; APPLICANT: Sato, Masahiko 

; TITLE OF INVENTION: METHOD OF BUILDING AND MAINTAINING BONE 
; FILE REFERENCE: X- 114 80 

; CURRENT APPLICATION NUMBER: US/O 9/169 , 786B 

; CURRENT FILING DATE: 1998-10-09 

; EARLIER APPLICATION NUMBER: US 60/061,800 

; EARLIER FILING DATE: 1997-10-14 

; NUMBER OF SEQ ID NOS : 12 

; SOFTWARE: Patent In Ver. 2.0 

; SEQ ID NO 12 

LENGTH: 4 0 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-169-786-12 

Query Match 82,1%; Score 23; DB 9; Length 40; 

Best Local Similarity 100.0%; Pred. No. 8e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIIIMIIIIMIIIIMIII 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 26 
US-09-843-221A-71 

; Sequence 71, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 22 lA 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS: 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 71 

LENGTH: 28 

TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
NAME/KEY: misc feature 



LOCATION: (6) - - (6) 
OTHER INFORMATION: D amino acid 
US-09-843-221A-71 



Query Match 78.6%; Score 22; DB 11; Length 28; 

Best Local Similarity 100.0%; Pred. No. 6.4e-16; 

Matches 22; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 7 KSIQDLRRRFFLHHLIAEIHTA 28 

llllllllllllllllllllll 
Db 7 KSIQDLRRRFFLHHLIAEIHTA 28 



RESULT 27 
US-09-843-221A-72 

; Sequence 72, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patentin version 3,1 
; SEQ ID NO 72 

LENGTH: 28 

TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified PTHrP 
NAME/KEY: misc_feature 
LOCATION: (6) . . (6) 
OTHER INFORMATION: D amino acid 
US-09-843-221A-72 

Query Match 78.6%; Score 22; DB 11; Length 28; 

Best Local Similarity 100.0%; Pred. No. 6.4e-16; 

Matches 22; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 7 KSIQDLRRRFFLHHLIAEIHTA 28 

IIIIMIIIIIIIMIMIIII 

Db 7 KSIQDLRRRFFLHHLIAEIHTA 28 



RESULT 28 
US-09-843-221A-75 



; Sequence 75, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INF0R^4ATI0N : 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 75 

LENGTH: 28 

TYPE : PRT 
; ORGANISM: Artificial Sequence 

FEATURE : 

OTHER INFORMATION: modified human PTHrP 
NAME/KEY: misc_feature 
LOCATION: (6) - . (6) 
OTHER INFORMATION: D amino acid 
US-09-843-221A-75 

Query Match 78.6%; Score 22; DB 11; Length 28; 

Best Local Similarity 100.0%; Pred. No. 6.4e-16; 

Matches 22; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 7 KSIQDLRRRFFLHHLIAEIHTA 28 

IIMMIIIIIIIIIIIIIIII 
Db 7 KSIQDLRRRFFLHHLIAEIHTA 28 



RESULT 2 9 
US-09-843-221A-67 

; Sequence 67, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE- 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843,221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 



; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 67 

LENGTH: 33 

TYPE: PRT 
; ORGANISM: Artificial Sequence 

FEATURE : 

OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-67 

Query Match 78.6%; Score 22; DB 11; Length 33; 

Best Local Similarity 100.0%; Pred. No. 7.4e-16; 

Matches 22; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 7 KSIQDLRRRFFLHHLIAEIHTA 28 

Illlllllllllllllllllll 
Db 12 KSIQDLRRRFFLHHLIAEIHTA 33 



RESULT 30 
US-09-843-221A-76 

; Sequence 76, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 22 lA 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS: 170 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 76 

LENGTH: 30 

TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-843-221A-76 

Query Match 67.9%; Score 19; DB 11; Length 30; 

Best Local Similarity 100.0%; Pred. No. 8.7e-13; 

Matches 19; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 6 GKSIQDLRRRFFLHHLIAE 24 

IIIIIIIIIIMIIIIIII 
Db 12 GKSIQDLRRRFFLHHLIAE 3 0 



RESULT 31 
US-09-843-221A-78 

; Sequence 78, Application US/09843221A 

; Publication No. US2003003 9654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843,221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

PRIOR FILING DATE: 2000-06-28 
; PRIOR APPLICATION NUMBER: 60/200,053 
; PRIOR FILING DATE: 2000-04-27 
; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 78 
LENGTH: 31 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: human PTHrP with non-human N-terminal peptide 
US-09-843-221A-78 

Query Match 67.9%; Score 19; DB 11; Length 31; 

Best Local Similarity 100.0%; Pred. No. 8.9e-13; 

Matches 19; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKS I QDLRRRFFLHHLI AE 24 

IIIIIIIIIIIIIIMIII 
Db 13 GKS I QDLRRRFFLHHLI AE 31 



RESULT 32 
US-09-843-221A-79 

; Sequence 79, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843,221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/2 66,673 

; PRIOR FILING DATE: 2001-02-06 



; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS : 170 

; SOFTWARE: Patentin version 3.1 

; SEQ ID NO 79 

LENGTH: 30 

TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
; NAME/KEY: misc_f eature 

LOCATION: (12) . . (12) 

OTHER INFORMATION: D amino acid 
US-09-843-221A-79 

Query Match 64.3%; Score 18; DB 11; Length 30; 

Best Local Similarity 100.0%; Pred. No. 9.4e-12; 

Matches 18; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 7 KSIQDLRRRFFLHHLIAE 24 

IIIIIMIIIIIIIIIM 

Db 13 KSIQDLRRRFFLHHLIAE 3 0 



RESULT 33 
US-09-843-221A-70 

; Sequence 70, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE- 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/ 09/843 , 22 lA 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS: 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 70 

LENGTH: 2 8 

TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified PTHrP 
US-09-843-221A-70 



Query Match 



60.7%; Score 17; DB 11; Length 28; 



Best Local Similarity 100.0%; Pred. No. 9,6e-ll; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 12 LRRRFFLHHLIAEIHTA 28 

Mlllllllllllllll 
Db 12 LRRRFFLHHLIAEIHTA 28 



RESULT 34 
US-10-097-079-77 

; Sequence 77, Application US/10097079 
; Publication No, US20020132973A1 
GENERAL INFORMATION: 

APPLICANT: Condon, Stephen M. 
; Morize, Isabelle 

TITLE OF INVENTION: PEPTIDE PARATHYROID HORMONE ANALOGS 
NUMBER OF SEQUENCES: 88 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Rhone-Poulenc Rorer Inc. 

STREET: 500 Areola Road, Mailstop 3C43 

CITY: Collegeville 

STATE : PA 

COUNTRY: USA 

ZIP: 19426 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/lO/097 , 079 

FILING DATE: 13-Mar-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/228,990 

FILING DATE: <Unknown> 

APPLICATION NUMBER: US 60/046,472 

FILING DATE: 14 -MAY- 19 97 
ATTORNEY/AGENT INFORMATION: 

NAME: Martin Esq., Michael B. 

REGISTRATION NUMBER: 37,521 

REFERENCE/DOCKET NUMBER: A2 678B-WO 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (610) 454-2793 

TELEFAX: (610) 454-3808 
INFORMATION FOR SEQ ID NO: 77: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 28 amino acids 

TYPE: amino acid 

STRANDEDNESS : <Unknown> 

TOPOLOGY: No. US20020132973A1 Relevant 
MOLECULE TYPE: peptide 
FRAGMENT TYPE: N-terminal 
FEATURE : 

NAME /KEY: Peptide 

LOCATION: 12.. 16 

OTHER INFORMATION: /product- "OTHER" 



; /note= "The side chains of Lys at position 12 and Asp at 

position 16 are linked by an amide bond." 
FEATURE : 

NAME/KEY: Peptide 
LOCATION: 28 

OTHER INFORMATION: /product- "OTHER" 
; /note= "This C-terminal amino acid is an amide, i.e., C0NH2 . 

SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
US-10-097-079-77 



Query Match 42.9%; Score 12; DB 14; Length 28; 

Best Local Similarity 100.0%; Pred. No. 1.4e-05; 

Matches 12; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 17 FLHHLIAEIHTA 28 

Illlllllllll 
Db 17 FLHHLIAEIHTA 28 



RESULT 35 
US-10-097-079-71 

; Sequence 71, Application US/10097079 
; Publication No. US20020132973A1 
GENERAL INFORMATION: 

APPLICANT: Condon, Stephen M. 

Morize, Isabelle 
TITLE OF INVENTION: PEPTIDE PARATHYROID HORMONE ANALOGS 
NUMBER OF SEQUENCES: 88 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Rhone-Poulenc Rorer Inc. 

STREET: 500 Areola Road, Mail stop 3C43 

CITY: Collegeville 

STATE: PA 

COUNTRY: USA 

ZIP: 19426 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 10/097 , 079 

FILING DATE: 13-Mar-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/228,990 

FILING DATE: <Unknown> 

APPLICATION NUMBER: US 60/046,472 

FILING DATE: 14-MAY-1997 
ATTORNEY/AGENT INFORMATION: 

NAME: Martin Esq., Michael B. 

REGISTRATION NUMBER: 37,521 

REFERENCE/DOCKET NUMBER: A2678B-WO 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (610) 454-2793 

TELEFAX: (610) 454-3808 
INFORMATION FOR SEQ ID NO: 71: 



SEQUENCE CHARACTERISTICS: 

LENGTH: 34 amino acids 
TYPE: amino acid 
STRANDEDNESS : <Unknown> 

TOPOLOGY: No. US20020132973A1 Relevant 
MOLECULE TYPE: peptide 
FRAGMENT TYPE: N-terminal 
FEATURE : 

NAME/KEY: Peptide 

LOCATION: 18., 22 

OTHER INFORMATION: /produCt= "OTHER" 
; /note= "The side chains of Lys at position 18 and Asp at 

; position 22 are linked by an amide bond. " 

FEATURE : 

NAME/KEY : Pept ide 

LOCATION: 34 

OTHER INFORMATION: /product = "OTHER" 
; /note= "This C-terminal amino acid is an amide, i.e., C0NH2 . " 

SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
US-10-097-079-71 



Query Match 42.9%; Score 12; DB 14; Length 34; 

Best Local Similarity 100.0%; Pred. No. 1.7e-05; 

Matches 12; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 17 FLHHLIAEIHTA 28 

Illlllllllll 
Db 23 FLHHLIAEIHTA 34 



RESULT 36 
US-09-843-221A-77 

; Sequence 77, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 22 lA 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2 001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 77 

LENGTH: 3 0 

TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 



OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-77 

Query Match 39.3%; Score 11; DB 11; Length 30; 

Best Local Similarity 100.0%; Pred. No. 0.00017; 

Matches 11; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKSIQDLRRRF 16 

Illllllllll 
Db 12 GKSIQDLRRRF 22 



RESULT 37 

US-09-843-221A-150 

; Sequence 150, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE- 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843,221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: PatentIn version 3.1 
; SEQ ID NO 150 

LENGTH: 30 

TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-150 



Query Match 3 9.3%; Score 11; DB 11; Length 30; 

Best Local Similarity 100.0%; Pred. No. 0.00017; 

Matches 11; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 5 LGKSIQDLRRR 15 

Illllllllll 
Db 11 LGKSIQDLRRR 21 



RESULT 38 

US-09-843-221A-114 

; Sequence 114, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 



; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 22 lA 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 114 
LENGTH: 34 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-114 

Query Match 39.3%; Score 11; DB 11; Length 34; 

Best Local Similarity 100,0%; Pred. No. 0.00018; 

Matches 11; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 5 LGKSIQDLRRR 15 

lllllllllll 
Db 11 LGKSIQDLRRR 21 



RESULT 3 9 
US-09-843-221A-65 

; Sequence 65, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS: 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 65 
LENGTH: 36 



TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-65 

Query Match 39.3%; Score 11; DB 11; Length 36; 

Best Local Similarity 100.0%; Pred. No. 0.00019; 

Matches 11; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 6 GKSIQDLRRRF 16 

IIMIIIIIII 
Db 12 GKSIQDLRRRF 22 



RESULT 4 0 

US-09-843-221A-103 

; Sequence 103, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 221A 

CURRENT FILING DATE: 2001-04-26 
; PRIOR APPLICATION NUMBER: 60/266,673 
; PRIOR FILING DATE: 2001-02-06 
; PRIOR APPLICATION NUMBER: 60/214,860 
; PRIOR FILING DATE: 2000-06-28 
; PRIOR APPLICATION NUMBER: 60/200,053 
; PRIOR FILING DATE: 2000-04-27 
; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 103 
LENGTH: 28 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-103 

Query Match 35.7%; Score 10; DB 11; Length 28; 

Best Local Similarity 100.0%; Pred. No. 0.0017; 

Matches 10; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 6 GKSIQDLRRR 15 

IIIIIIMM 

Db 6 GKSIQDLRRR 15 



Search completed: January 14, 2004, 11:15:11 
Job time : 19.8411 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: January 14, 2004, 10:28:19 ; Search time 20.9346 Seconds 

(without alignments) 
345.145 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 
Word size : 



US-09-843-221A-169 
28 

1 LLHNLGKSIQDLRRRFFLHHLIAEIHTA 28 
OLIGO 

Gapop 60.0 , Gapext 60.0 
830525 seqs, 258052604 residues 
0 



Total number of hits satisfying chosen parameters: 13497 

Minimum DB seq length: 28 
Maximum DB seq length: 4 0 



Post -processing: Listing first 1000 summaries 



Database : SPTREMBL_23 : * 

1 : sp__archea: * 

2: sp_bacteria : * 

3 : sp_f ungi : * 

4 : sp_human : * 

5 : sp_invertebrate : * 

6 : sp_mammal : * 

7 : sp_mhc : * 

8 : sp_organelle: * 

9 : sp_phage : * 

10: sp_plant : * 

11: sp_rodent:* 

12 : sp_virus : * 

13 : sp_vertebrate : * 

14 : sp_unclassif ied: * 

15: sp_rvirus:* 

16: sp_bacteriap: * 

17: sp_archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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Q8JA82 
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Q8JA88 
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Q8J9R3 
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Q8JAC4 
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Q8J9Q7 


081 9q7 


human 


immun 


104 


4 


14 


. 3 


35 


15 


Q8J9P3 


Q8j 9p3 
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Q8JA93 


Q8ja93 
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Q8JAA2 
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Q8J9P2 


Q8j9p2 
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Q8JAB7 


Q8jab7 
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Q8JAA8 
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human 
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Q8J9N8 


Q8j9n8 


human 


immun 


111 


4 


14, 


.3 


35 


15 


Q8JA98 
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Q75990 
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human 


immun 


113 


4 


14 , 


.3 


35 


15 


Q75989 
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Q70833 
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human 
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Q8J9R2 


Q8j9r2 human immun 


116 
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Q8JAC0 


Q8jac0 human immun 


117 
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Q76253 


Q76253 human immun 


118 
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Q70798 


Q70798 human immun 


119 
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15 


Q8JAA4 


Q8jaa4 human immun 


120 
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35 


15 


Q8J9P6 


Q8j9p6 human immun 
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Q8JA46 


Q8ja46 human immun 


122 


4 
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Q70300 


Q70300 human immun 
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15 


Q8J9Q3 


Q8j9q3 human immun 


124 
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15 


Q8J9C7 


Q8j9c7 human immun 


125 
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15 


Q8J9Q1 


Q8j9ql human immun 


126 
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Q8J9N7 


Q8j9n7 human immun 


127 
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Q8JAC1 


QSjacl human immun 


128 
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Q8JAB4 


Q8jab4 human immun 
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Q8J9R4 


Q8j9r4 human immun 


130 
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Q8J9P7 


Q8j9p7 human immun 


131 
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Q8JAB3 


Q8jab3 human immun 
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Q77250 


Q77250 human immun 


133 
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15 


Q75955 


Q75 955 human immun 


134 
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15 


Q8J9Q4 


Q8j9q4 human immun 


135 
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15 


Q70776 


Q70776 human immun 
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Q8J9P1 


Q 8 j 9p 1 human i mmun 
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Q8j n immun 


138 
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Q70827 


Q70827 human immun 
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Q8JA99 


Q8ja99 human immun 


140 
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Q9PBV9 


Q9pbv9 xylella fas 
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Q8EXR3 


Q8exr3 leptospira 
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Q8EEP3 


Q8eep3 shewanella 
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Q15530 


Q1553 0 homo sapien 


144 
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Q79434 


Q79434 human immun 
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Q79419 


Q79419 human immun 
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Q8EYJ2 


Q8eyj2 leptospira 
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Q8MDC2 


Q8mdc2 larix gmeli 
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QSayzO goose circo 
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Q8CLL0 


Q8cll0 yersinia pe 
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Q9XCW6 


Q9xcw6 rhodobacter 
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P97145 


P97145 escherichia 
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Q9BWQ4 


Q9bwq4 homo sapien 
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Q8X3X9 


Q8x3x9 escherichia 
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Q9FEY1 


Q9feyl heterocapsa 
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Q9QWJ4 


Q9qwj4 mus musculu 
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Q04259 


Q04259 influenza a 
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Q90SK8 


Q90sk8 human immun 
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Q90SK7 


Q90sk7 human immun 
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Q8EZ33 


Q8ez33 leptospira 
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P78340 


P78340 homo sapien 
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Q9H3B8 


Q9h3b8 homo sapien 
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Q8V647 


Q8v647 rabies viru 


166 
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Q9WGB3 


Q9wgb3 human herpe 
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Q9AH13 
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Q47736 


Q47736 enterococcu 
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Q9ZG30 


Q9zg3 0 chlamydia t 
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Q49327 
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Q9R5E6 


Q9r5e6 thermus aqu 
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Q9REI4 


Q9rei4 acidiphiliu 
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Q9R5G3 


Q9r5g3 bacteroides 
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Q9XB41 


Q9xb41 escherichia 
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Q9R4Z2 


Q9r4z2 lactobacill 
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3 


10 . 


7 
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Q9ZIZ8 


Q9ziz8 borrelia bu 
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Q9UT99 


Q9ut99 schizosacch 
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Q9URF2 


Q9urf2 trametes ve 


179 
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Q9UEN1 


Q9uenl homo sapien 
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043804 


0438 04 homo sapien 
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.7 
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4 


Q96RV7 


Q96rv7 homo sapien 
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Q16325 


Q16325 homo sapien 
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1043Y 75X 737 


Lengt5095737 homo sapien 


184 
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Q96G80 


Q96g8 0 homo sapien 
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Q8IXD3 
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Q9BJE1 
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Q9BJE0 


Q9bje0 pauropus sp 


188 


3 


10. 


,7 


28 


5 


Q9BJE3 


Q9bje3 pauropus sp 
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Q9GPK5 


Q9gpk5 scutigerell 
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Q95P93 


Q95p93 mesobuthus 
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Q9BM76 


Q9bm76 giardia lam 
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Q9BM74 


Q9bm74 giardia lam 
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Q9GPK6 


Q9gpk6 scutigerell 
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Q9BJE2 


Q9bje2 pauropus sp 
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Q9bm68 glottidia p 
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Q9bm75 giardia lam 
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Q9trm4 bos taurus 
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Q8MJG7 


Q8mjg7 sus scrofa 


200 


3 


10, 


.7 


28 


6 


Q9N264 


Q9n264 bos taurus 


201 


3 


10, 


.7 


28 


6 


Q9TSY9 
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Q9T4F8 
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Q8s6h4 oryza sativ 
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Q9QWC0 


Q9qwc0 rattus sp. 
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Q9QWQ3 


Q9qwq3 rattus norv 
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P70542 


P70542 rattus norv 
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P97914 rattus norv 
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055021 mus musculu 
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Q63743 rattus norv 
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Q8CFB9 


Q8cfb9 mus musculu 
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Q8C1H5 


Q8clh5 mus musculu 
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Q9wni9 tt virus, o 


224 


3 
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Q68552 


Q68552 hepatitis c 
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Q9DD70 


Q9dd70 gallus gall 
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Q8QGY7 


Q8qgy7 fugu rubrip 
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Q90757 gallus gall 
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Q9DFT7 


Q9dft7 gallus gall 
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Q73626 


Q7362 6 human immun 
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071995 human endog 
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Q70310 human immun 
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Q8ENT7 


Q8ent7 oceanobacil 
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Q8E014 


Q8e014 streptococc 
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Q8ZYY3 
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Q93IG2 
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Q49148 
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Q9R526 
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Q9R5G7 
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Q9K340 
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Q9ACD9 


Q9acd9 vibrio salm 
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Q51692 
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Q9HDQ3 


Q9hdq3 Candida rug 


245 
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29 


4 


Q9Y3G1 


Q9y3gl homo sapien 


246 
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29 


4 


Q9H2A1 


Q9h2al homo sapien 


247 
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4 


Q9UEF0 


Q9uefO homo sapien 


248 
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4 


095485 


095485 homo sapien 


249 
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4 


Q8TAI5 


Q8tai5 homo sapien 
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Q967U3 


Q967u3 schistocerc 


251 
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Q24684 


Q24684 dugesia tig 


252 


3 


10 , 


. 7 


29 


5 


Q8T3E8 


Q8t3e8 caenorhabdi 
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Q24688 


Q24688 dugesia tig 


254 


3 


10 , 


. 7 


29 


5 


Q95SE3 


Q95se3 drosophila 
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Q967U4 


Q967u4 schistocerc 
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Q24683 


Q24683 dugesia tig 
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Q8T936 


Q8t936 folsomia ca 
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Q246B7 


Q24687 dugesia tig 
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Q8ITD6 


Q8itd6 schistosoma 
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Q9TRL6 


Q9trl6 bos taurus 
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Q9TSY6 


Q9tsy6 pan troglod 


262 
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Q8SQD5 


Q8sqd5 macaca mula 
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062784 


062784 isoodon mac 
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Q8HWM0 


QShwmO homo sapien 
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Q9G651 


Q9g651 otocryptis 
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Q9ZX23 


Q9zx23 mycobacteri 


267 
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P82193 


P82193 spinacia ol 
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Q9QWM6 


Q9qwm6 mus musculu 
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088227 


088227 mus musculu 
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11 


088221 


088221 mus musculu 


271 


3 
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11 


Q9Z2C0 


Q9z2c0 mus musculu 


272 


3 


10 
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11 


Q91VE8 


Q91ve8 mus musculu 


273 


3 


10 


. 7 
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11 


Q8K362 


Q8k3 62 mus musculu 
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3 
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11 


Q64353 


Q64353 mus musculu 


275 
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088225 


088225 mus musculu 


276 
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11 


088214 


088214 mus musculu 
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Q9Z2C1 
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3 
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088217 


088217 mus musculu 
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3 
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070564 


070564 mus musculu 
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3 
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11 


Q91UZ3 


Q91uz3 mus musculu 
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3 
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088218 


088218 mus musculu 


282 


3 
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Q9QWN1 


Q9qwnl mus musculu 
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088223 


088223 mus musculu 
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Q99JY5 


Q99jy5 mus musculu 
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088220 


088220 mus musculu 
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088215 


08 8215 mus musculu 


287 
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Q8CGM8 


Q8cgm8 mus musculu 
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Q86235 


Q86235 human rotav 
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Q86232 


Q8 6232 human rotav 
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Q86233 


Q8 6233 human rotav 
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056835 vibrio chol 


292 
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Q86234 
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3 
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13 


Q9PRF9 


Q9prf9 brachydanio 
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3 
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.7 
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13 


Q91396 


Q91396 gallus gall 
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3 
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13 


013043 
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514 


3 


10 , 


7 


32 


4 


Q9BY35 


Q9by3 5 homo sapien 




3 


10 . 


7 


32 


4 


Q9UEB0 


Q9ueb0 homo sapien 


J J. u 


3 


10 . 


7 


32 


4 


Q8TB02 


Q8tb02 homo sapien 


R 1 7 
OX/ 


3 


10 . 


7 


32 


4 


Q9HAX8 


Q9hax8 homo sapien 


sift 

J J- o 


3 


10 , 


7 


32 


4 


Q8TBQ3 


Q8tbq3 homo sapien 




3 


10 . 


7 


32 


4 


Q96D66 


Q96d66 homo sapien 


3 ^ U 


3 


10 . 


7 


32 


4 


Q8NG01 


Q8ng01 homo sapien 


521 


3 


10 . 


7 


32 


4 


Q9UNE9 


Q9une9 homo sapien 


522 


3 


10 . 


7 


32 


5 


Q9GPD9 


Q9gpd9 drosophila 


523 


3 


10 . 


7 


32 


5 


Q9GTB1 


Q9gtbl eimeria ten 


524 


3 


10 . 


7 


32 


5 


Q26651 


Q26651 strongyloce 


525 


3 


10 . 


7 


32 


5 


Q26828 


Q26828 trypanosoma 


526 


3 


10 . 


7 


32 


5 


Q8T382 


Q8t382 leishmania 


527 


3 


10 . 


7 


32 


5 


096634 


096634 trypanosoma 


S9fl 


3 


10 . 


7 


32 


5 


Q9GNH0 


Q9gnh0 drosophila 


529 


3 


10 . 


7 


32 


5 


Q9GT99 


Q9gt99 babes ia bov 


530 


3 


10 , 


7 


32 


5 


Q9GTC0 


Q9gtc0 neospora ca 


531 


3 


10 , 


7 


32 


6 


Q9TSE6 


Q9tse6 oryctolagus 


532 


3 


10 . 


7 


32 


6 


062854 


062854 oryctolagus 


533 


3 


10 , 


. 7 


32 


6 


Q9MYR8 


Q9myr8 oryctolagus 


534 


3 


10 . 


, 7 


32 


6 


Q9TSE5 


Q9tse5 oryctolagus 


535 


3 


10 . 


. 7 


32 


6 


077647 


077647 macropus eu 


536 


3 


10 . 


. 7 


32 


7 


Q95HL3 


Q95hl3 homo sapien 


537 


3 


10 . 


. 7 


32 


7 


Q8SNB9 


Q8snb9 homo sapien 


J o o 


3 


10 . 


, 7 


32 


7 


Q31597 


Q31597 xenopus lae 


J J 


3 


10 . 


. 7 


32 


7 


Q9BCT8 


Q9bct8 homo sapien 


54 0 


3 


10 , 


. 7 


32 


7 


Q8MHQ2 


Q8mhq2 homo sapien 


541 


3 


10 . 


. 7 


32 


7 


Q9BCU0 


Q9bcu0 homo sapien 


542 


3 


10 


. 7 


32 


8 


Q94VL5 


Q94vl5 salmo trutt 


543 


3 


10 


, 7 


32 


8 


Q94NY1 


Q94nyl salmo salar 


544 


3 


10 


. 7 


32 


8 


Q36463 


Q36463 onychomys 1 


545 


3 


10 


. 7 


32 


8 


Q9T2Q1 


Q9t2ql pinus sylve 


546 


3 


10 


. 7 


32 


8 


Q951P3 


Q951p3 salmo salar 


547 


3 


10 


. 7 


32 


9 


Q9MBU5 


Q9mbu5 chlamydia p 


548 


3 


10 


. 7 


32 


9 


Q9MBU9 


Q9mbu9 vibrio phag 


549 


3 


10 


. 7 


32 


10 


Q9S8X7 


Q9s8x7 daucus caro 


550 


3 


10 


. 7 


32 


10 


Q9FUQ1 


Q9fuql zea mays (s 


551 


3 


10 


, 7 


32 


10 


09FU04 


Q9fuq4 zea mays (s 


552 


3 


10 


. 7 


32 


10 


Q9FE80 


Q9fe80 zea diplope 


553 


3 


10 


. 7 


32 


10 


Q9FUP8 


Q9fup8 zea luxuria 


554 


3 


10 


. 7 


32 


11 


Q9R0E3 


Q9r0e3 mus musculu 


555 


3 


10 


. 7 


32 


11 


Q9QVG6 


Q9qvg6 rattus sp . 


556 


3 


10 


. 7 


32 


11 


P97642 


P97642 rattus norv 


557 


3 


10 


. 7 


32 


11 


Q61695 


Q61695 mus musculu 


558 


3 


10 


. 7 


32 


11 


Q9QWN2 


Q9qwn2 mus musculu 


559 


3 


10 


. 7 


32 


11 


P97649 


P97649 rattus norv 


560 


3 


10 


. 7 


32 


11 


Q8C2N8 


Q8c2n8 mus musculu 


J U J- 


3 


10 


^ 7 


32 


11 


Q8BS12 


Q8bsl2 mus musculu 


562 


3 


10 


, 7 


32 


12 


Q66855 


Q66855 foot-and-mo 


563 


3 


10 


.7 


32 


13 


Q8QG73 


Q8qg73 oncorhynchu 


564 


3 


10 


.7 


32 


13 


Q8QG72 


Q8qg72 salmo salar 


565 


3 


10 


.7 


32 


13 


Q8QG71 


Q8qg71 oncorhynchu 


566 


3 


10 


.7 


32 


13 


Q9PS21 


Q9ps21 carassius a 


567 


3 


10 


.7 


32 


13 


Q8UUN6 


Q8uun6 xenopus lae 


568 


3 


10 


.7 


32 


13 


Q8QG84 


Q8qg84 oncorhynchu 


569 


3 


10 


.7 


32 


13 


Q8QG83 


Q8qg83 oncorhynchu 


570 


3 


10 


.7 


32 


13 


Q8QG82 


Q8qg82 oncorhynchu 



571 


3 


10 . 


. 7 


32 


13 


Q8QG70 


Q8qg70 salvelinus 


572 


3 


10, 


, 7 


32 


13 


P82780 


P82780 rana catesb 


573 


3 


10. 


, 7 


32 


13 


Q9W7P3 


Q9w7p3 morone saxa 


574 


3 


10. 


. 7 


32 


13 


Q9W7P2 


Q9w7p2 morone saxa 


575 


3 


10 . 


. 7 


32 


16 


050851 


050851 borrelia bu 


576 


3 


10. 


,1 


32 


16 


Q9PCD4 


Q9pcd4 xylella fas 


577 


3 


10. 


,1 


32 


16 


Q9KVF7 


Q9kvf7 vibrio chol 


578 


3 


10. 


.7 


32 


16 


Q9KTM5 


Q9ktm5 vibrio chol 


579 


3 


10. 


. 7 


32 


16 


Q9KKU3 


Q9kku3 vibrio chol 


580 


3 


10, 


,1 


32 


16 


Q9K6N3 


Q9k6n3 bacillus ha 


581 


3 


10 . 


,1 


32 


16 


Q9K175 


Q9kl75 neisseria m 


582 


3 


10. 


.7 


32 


16 


Q97SW9 


Q97sw9 streptococc 


583 


3 


10. 


.7 


32 


16 


Q97RT9 


Q97rt9 streptococc 


584 


3 


10. 


.7 


32 


16 


Q97NT4 


Q97nt4 streptococc 


585 


3 


10. 


.7 


32 


16 


Q8VKF9 


Q8vkf9 mycobacteri 


586 


3 


10, 


,7 


32 


16 


Q9CDK7 


Q9cdk7 lactococcus 


587 


3 


10, 


,1 


32 


16 


Q8KCV0 


Q8kcv0 chlorobium 


588 


3 


10. 


,7 


32 


16 


Q8KCP1 


QSkcpl chlorobium 


589 


3 


10. 


.7 


32 


16 


Q8EFA5 


Q8efa5 shewanella 


590 


3 


10. 


.7 


32 


16 


Q8ECK5 


Q8eck5 shewanella 


591 


3 


10. 


.7 


32 


16 


Q8E9X5 


Q8e9x5 shewanella 


592 


3 


10. 


.7 


32 


16 


Q8E9D2 


Q8e9d2 shewanella 


593 


3 


10. 


. 7 


32 


16 


Q8CTI0 


QSctiO staphylococ 


594 


3 


10. 


.7 


32 


16 


Q8CTH2 


Q8cth2 staphylococ 


595 


3 


10. 


. 7 


32 


17 


Q8ZZF7 


Q8zzf7 pyrobaculum 


596 


3 


10, 


. 7 


33 


1 


Q9UWL4 


Q9uwl4 methanopyru 


597 


3 


10 , 


, 1 


33 


2 


Q9S3I9 


Q9s3i9 haemophilus 


598 


3 


10, 


, 7 


33 


2 


Q9R5N9 


Q9r5n9 Clostridium 


599 


3 


10, 


. 7 


33 


2 


Q9K370 


Q9k370 rhizobium 1 


600 


3 


10, 


. 7 


33 


2 


Q9R2F6 


Q9r2f6 klebsiella 


601 


3 


10, 


. 7 


33 


2 


Q9S3N5 


Q9s3n5 bacillus ce 


602 


3 


10, 


,7 


33 


2 


Q9ZG76 


Q9zg76 chlamydia t 


603 


3 


10. 


. 7 


33 


2 


Q9KI23 


Q9ki23 helicobacte 


604 


3 


10. 


,7 


33 


2 


Q56122 


Q56122 salmonella 


605 


3 


10. 


.7 


33 


2 


Q09122 


Q09122 fusobacteri 


606 


3 


10. 


, 7 


33 


2 


P82583 


P82583 streptococc 


607 


3 


10. 


, 7 


33 


2 


Q8GQU2 


Q8gqu2 leptospira 


608 


3 


10. 


, 7 


33 


2 


Q8GG98 


Q8gg98 neisseria p 


609 


3 


10. 


, 7 


33 


3 


Q8TGR1 


QStgrl saccharomyc 


610 


3 


10. 


, 7 


33 


3 


Q10986 


Q10986 absidia. ba 


611 


3 


10. 


, 7 


33 


4 


Q9NPZ1 


Q9npzl homo sap i en 


612 


3 


10. 


, 7 


33 


4 


Q9BUZ1 


Q9buzl homo sap i en 


613 


3 


10. 


. 7 


33 


4 


Q15285 


Q15285 homo sapien 


614 


3 


10- 


. 7 


33 


4 


Q9UMG4 


Q9umg4 homo sapien 


615 


3 


10. 


,7 


33 


4 


Q9BST3 


Q9bst3 homo sapien 


616 


3 


10. 


.7 


33 


4 


Q9BV16 


Q9bvl6 homo sapien 


617 


3 


10. 


.7 


33 


4 


Q8WWE0 


QSwweO homo sapien 


618 


3 


10. 


.7 


33 


4 


Q9BYF4 


Q9byf4 homo sapien 


619 


3 


10. 


. 7 


33 


4 


Q92668 


Q92668 homo sapien 


620 


3 


10. 


, 7 


33 


5 


Q25433 


Q25433 molgula cit 


621 


3 


10, 


7 


33 


5 


Q9GTB6 


Q9gtb6 eimeria ten 


622 


3 


10. 


7 


33 


5 


Q9GT96 


Q9gt96 crypt ospori 


623 


3 


10. 


7 


33 


5 


Q9GTA3 


Q9gta3 babes ia bov 


624 


3 


10. 


7 


33 


5 


Q9GTB0 


Q9gtb0 sarcocystis 


625 


3 


10 . 


7 


33 


5 


Q27637 


Q27637 drosophila 


626 


3 


10- 


7 


33 


5 


Q8WTH4 


Q8wth4 heteroptery 


627 


3 


10. 


7 


33 


5 


Q9NGN0 


Q 9 ngn 0 s t r ongy 1 o c e 



628 


3 


10. 


7 


33 


5 


Q9GTA7 


Q9gta7 sarcocystis 


629 


3 


10. 


7 


33 


5 


Q9GTB8 


Q9gt]D8 neospora ca 


630 


3 


10 . 


7 


33 


5 


Q9GTA5 


Q9gta5 babes ia bov 


631 


3 


10 . 


7 


33 


5 


Q95S45 


Q95s45 drosophila 


632 


3 


10 . 


7 


33 


5 


Q9GT98 


Q9gt98 cryptospori 


633 


3 


10. 


7 


33 


5 


017147 


017147 echinococcu 


634 


3 


10. 


7 


33 


5 


Q9VHD7 


Q9vhd7 drosophila 


635 


3 


10. 


7 


33 


6 


Q28788 


Q28788 papio hamad 


636 


3 


10. 


7 


33 


6 


077664 


077664 sus scrofa 


637 


3 


10. 


7 


33 


6 


Q9TSX7 


Q9tsx7 sus scrofa 


638 


3 


10. 


7 


33 


6 


Q29066 


Q29066 sus scrofa 


639 


3 


10 . 


7 


33 


6 


Q9TRD6 


Q9trd6 oryctolagus 


640 


3 


10. 


7 


33 


6 


Q8MIL2 


Q8mil2 bos taurus 


641 


3 


10. 


7 


33 


8 


Q9BAC6 


Q9bac6 euglena gra 


642 


3 


10. 


7 


33 


8 


Q9BAC1 


Q9bacl euglena ste 


643 


3 


10. 


7 


33 


8 


Q8WI31 


Q8wi31 psilotum nu 


644 


3 


10. 


7 


33 


8 


Q9BAC4 


Q9bac4 euglena mut 


645 


3 


10. 


7 


33 


8 


Q9MS75 


Q9ms75 euglena ana 


646 


3 


10 . 


7 


33 


8 


047024 


047024 barbeya ole 


647 


3 


10. 


7 


33 


9 


Q8SCC2 


Q8scc2 vibrio harv 


648 


3 


10. 


7 


33 


9 


Q38588 


Q38588 bacteriopha 


649 


3 


10. 


.7 


33 


9 


Q38551 


Q38 551 bacteriopha 


650 


3 


10. 


,7 


33 


10 


Q02904 


Q02904 arabidopsis 


651 


3 


10. 


,7 


33 


10 


Q94IJ2 


Q94ij2 zea mays (m 


652 


3 


10. 


.7 


33 


10 


Q9XJ70 


Q9xj7 0 cucumis sat 


653 


3 


10, 


,1 


33 


10 


Q9AYQ5 


Q9ayq5 cucumis sat 


654 


3 


10, 


.7 


33 


10 


Q9XJ69 


Q9xj69 cucumis sat 


655 


3 


10. 


. 7 


33 


10 


Q8H7H6 


Q8h7h6 arabidopsis 


656 


3 


10. 


.7 


33 


11 


088226 


088226 mus musculu 


657 


3 


10. 


.7 


33 


11 


Q9JLA8 


Q9jla8 mus musculu 


658 


3 


10. 


.7 


33 


11 


Q9Z298 


Q9z298 rattus norv 


659 


3 


10 


.7 


33 


11 


Q9QVM2 


Q9qvm2 mus sp. glu 


660 


3 


10 


.7 


33 


11 


P97670 


P97670 rattus norv 


661 


3 


10 


.7 


33 


11 


Q62539 


Q6253 9 mus spretus 


662 


3 


10 


. 7 


33 


12 


072982 


072982 hepatitis c 


663 


3 


10 


.7 


33 


12 


073068 


073068 hepatitis c 


664 


3 


10 


.7 


33 


12 


072979 


072979 hepatitis c 


665 


3 


10 


. 7 


33 


12 


072988 


072988 hepatitis c 


666 


3 


10 


. 7 


33 


12 


072992 


072992 hepatitis c 


667 


3 


10 


.7 


33 


12 


072990 


072990 hepatitis c 


668 


3 


10 


.7 


33 


12 


073010 


073010 hepatitis c 


669 


3 


10 


.7 


33 


12 


072981 


072981 hepatitis c 


670 


3 


10 


.7 


33 


12 


009759 


009759 lettuce mos 


671 


3 


10 


.7 


33 


12 


Q65687 


Q65687 bovine papi 


672 


3 


10 


.7 


33 


12 


073008 


073008 hepatitis c 


673 


3 


10 


. 7 


33 


12 


072986 


072986 hepatitis c 


674 


3 


10 


.7 


33 


12 


072993 


072993 hepatitis c 


675 


3 


10 


.7 


33 


12 


072984 


072984 hepatitis c 


676 


3 


10 


. 7 


33 


12 


073005 


073005 hepatitis c 


677 


3 


10 


.7 


33 


12 


073067 


073067 hepatitis c 


678 


3 


10 


.7 


33 


12 


072985 


072985 hepatitis c 


679 


3 


10 


.7 


33 


12 


Q9PXQ9 


Q9pxq9 tobacco mos 


680 


3 


10 


.7 


33 


12 


072983 


072983 hepatitis c 


681 


3 


10 


.7 


33 


12 


072987 


072987 hepatitis c 


682 


3 


10 


.7 


33 


12 


Q69461 


Q69461 human herpe 


683 


3 


10 


.7 


33 


12 


072978 


072978 hepatitis c 


684 


3 


10 


.7 


33 


12 


073009 


073009 hepatitis c 



685 


3 


10 , 


7 


33 


13 


Q9DD87 


Q9dcl87 brachyramph 


686 


3 


10 . 


7 


33 


13 


Q9PT34 


Q9pt34 oncorhynchu 


687 


3 


10 . 


7 


33 


13 


Q8UUR8 


Q8uur8 brachydanio 


688 


3 


10 . 


7 


33 


13 


09DD86 


Q9dd86 brachyramph 


689 


3 


10 . 


7 


33 


13 


Q9W7N9 


Q9w7n9 morone saxa 


690 


3 


10 . 


7 


33 


15 


Q9DZ98 


Q9dz98 human immun 


691 


3 


10 . 


7 


33 


15 


Q9QKI9 


Q9qki9 human immun 


692 


3 


10 . 


7 


33 


15 


Q8QDU0 


Q8qdu0 human immun 


693 


3 


10 . 


7 


33 


15 


Q70443 


Q70443 human immun 


694 


3 


10 . 


7 


33 


15 


Q8QDT9 


Q8qdt9 human immun 


695 


3 


10 . 


7 


33 


15 


Q9PXE1 


Q9pxel human immun 


696 


3 


10 . 


7 


33 


16 


025518 


025518 helicobacte 


697 


3 


10 . 


7 


33 


16 


050777 


050777 borrelia bu 


698 


3 


10 . 


7 


33 


16 


Q9KSG4 


Q9ksg4 vibrio chol 


699 


3 


10 . 


, 7 


33 


16 


Q9KRR7 


Q9krr7 vibrio chol 


700 


3 


10 . 


. 7 


33 


16 


09KOP4 


Q9kqp4 vibrio chol 


701 


3 


10 , 


, 7 


33 


16 


Q9JY44 


Q9jy44 neisseria m 


702 


3 


10 . 


. 7 


33 


16 


Q9JUM4 


Q9jum4 neisseria m 


703 


3 


10 . 


. 7 


33 


16 


Q9A388 


Q9a388 caulobacter 


704 


3 


10 . 


. 7 


33 


16 


Q8VK01 


QSvkOl mycobacteri 


705 


3 


10 . 


, 7 


33 


16 


Q8VJK1 


QSvjkl mycobacteri 


706 


3 


10 . 


, 7 


33 


16 


Q8NT95 


Q8nt95 corynebacte 


707 


3 


10 . 


, 1 


33 


16 


Q8NLP2 


Q8nlp2 corynebacte 


708 


3 


10 . 


. 7 


33 


16 


Q8KG99 


Q8kg99 chlorobium 


709 


3 


10 , 


. 7 


33 


16 


Q8KE00 


Q8ke00 chlorobium 


710 


3 


10 . 


. 7 


33 


16 


Q8KBZ0 


QSkbzO chlorobium 


711 


3 


10 , 


, 7 


33 


16 


Q8G1K3 


Q8glk3 brucella su 


712 


3 


10 , 


. 7 


33 


16 


Q8G173 


Q8gl73 brucella su 


713 


3 


10 , 


. 7 


33 


16 


Q8G0U8 


Q8g0u8 brucella su 


714 


3 


10 . 


. 7 


33 


16 


Q8FYD4 


Q8fyd4 brucella su 


715 


3 


10 . 


. 7 


33 


16 


Q8FSG0 


QSfsgO corynebacte 


716 


3 


10 , 


. 7 


33 


16 


Q8EHN3 


Q8ehn3 shewanella 


717 


3 


10 . 


. 7 


33 


16 


Q8EFM7 


Q8efm7 shewanella 


718 


3 


10 


. 7 


33 


16 


Q8EEJ9 


Q8eej9 shewanella 


719 


3 


10 . 


. 7 


33 


16 


Q8ECS9 


Q8ecs9 shewanella 


720 


3 


10 


. 7 


33 


16 


Q8E877 


Q8e877 shewanella 


721 


3 


10 


. 7 


33 


16 


Q8DWK4 


Q8dwk4 streptococc 


722 


3 


10 


. 7 


33 


16 


Q8CTR8 


Q8ctr8 staphylococ 


723 


3 


10 


. 7 


33 


17 


09HSX6 


Q9hsx6 halobacteri 


724 


3 


10 


. 7 


34 


1 


Q9P9L8 


Q9p918 methanosarc 


725 


3 


10 


. 7 


34 


2 


Q8KYQ3 


Q8kyq3 bacillus an 


726 


3 


10 


. 7 


34 


2 


Q02571 


Q02571 klebsiella 


727 


3 


10 


. 7 


34 


2 


Q8KJV6 


Q8kjv6 proteus vul 


728 


3 


10 


. 7 


34 


2 


Q50998 


Q50998 neisseria g 


729 


3 


10 


. 7 


34 


2 


Q43910 


Q43910 azospirillu 


730 


3 


10 


. 7 


34 


2 


Q51131 


Q51131 neisseria m 


731 


3 


10 


. 7 


34 


2 


Q8KYH2 


Q8kyh2 bacillus an 


732 


3 


10 


. 7 


34 


2 


Q9KWG7 


Q9kwg7 citrobacter 


733 


3 


10 


. 7 


34 


2 


Q53594 


Q53 594 streptomyce 


734 


3 


10 


.7 


34 


2 


Q9RZW6 


Q9rzw6 borrelia bu 
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. 7 


35 


10 


08GUW8 


08cniw8 ni ppa Tn^Ti^ 

\^ \J ^ Ll WW (J X d L I Ld X X d 


949 


3 


10 . 


. 7 


35 


10 


Q8GS04 


08crs04 airahidoDR 1 R 


950 


3 


10 . 


. 7 


35 


11 


063896 


063 8 96 mus rd nk- 


951 


3 


10 . 


. 7 


35 


11 


Q63397 


063'^97 "rattiiR nnrv 

^ U >J --J ^ 1 X d L« lXk3 1J.WX V 


952 


3 


10 . 


. 7 


35 


11 


055083 


ORRnA"^ miiQ TTinQr'nln 


953 


3 


10 , 


, 7 


35 


11 


Q8K1C8 


08klc8 TTiiis musculu 


954 


3 


10 . 


. 7 


35 


11 


Q60608 


060^08 mnc? mnRr'nlii 

\^ \J \J \J \J Ll Ll^$^ llll^O \^ 1-XX 


955 


3 


10 . 


, 7 


35 


11 




^^(^VJW X d U LL£3 . 


956 


3 


10 . 


, 7 


35 


11 


P97643 


Irj//0*±0 XdUL.U.0 litJiV 


957 


3 


10 . 


. 7 


35 


11 


n97?G'^ 


OQ'7'[?n''^ 7"a "1" i" n Q nnT^T' 

^^^^^O X d L. Llo IILJI. V 


958 


3 


10 . 


, 7 


35 


11 


O9R086 


091^08^ mnci nmicir'nln 


959 


3 


10 . 


, 7 


35 


11 


090VT2 


09rrvt"? mnR rti vpt" 

'j^-'^-^v L. ^ LIILXO • X C Lrf 


960 


3 


10 . 


. 7 


35 


11 


Q8CC11 


ORrT'l 1 TTrnQ rrn 1 Q r*n 1 n 

\^O^^XX ILILIO lllLlO LiX Li 


961 


3 


10 . 


, 7 


35 


12 


Q9YR05 


09vrOS "nnrpinp hit 

^ X W ^^V^X XA A^ V_. X X 


962 


3 


10. 


,7 


35 


12 


Q9YQW9 


Q9yqw9 porcine cir 


963 


3 


10. 


,7 


35 


12 


091864 


091864 porcine cir 


964 


3 


10. 


.7 


35 


12 


Q91HA8 


Q91ha8 porcine cir 


965 


3 


10. 


,7 


35 


12 


093206 


093206 porcine cir 


966 


3 


10. 


, 7 


35 


12 


056131 


056131 porcine cir 


967 


3 


10. 


, 7 


35 


12 


Q8B6X7 


Q8b6x7 human rotav 


968 


3 


10. 


7 


35 


13 


Q9PU87 


Q9pu87 catla catla 


969 


3 


10. 


7 


35 


13 


Q09132 


Q09132 xenopus lae 



970 


3 


10. 


7 


35 


13 


Q09133 


Q09133 


xenopus lae 


971 


3 


10 . 


7 


35 


15 


Q70411 


Q70411 


human 


immun 


972 


3 


10 . 


7 


35 


15 


Q76748 


Q76748 


human 


immun 


973 


3 


10 . 


7 


35 


15 


Q80458 


Q80458 


human 


immun 


974 


3 


10 . 


7 


35 


15 


091391 


091391 


human 


immun 


975 


3 


10. 


7 


35 


15 


Q76728 


Q76728 


human 


immun 


976 


3 


10- 


7 


35 


15 


040565 


040565 


human 


immun 


311 


3 


10. 


7 


35 


15 


Q79885 


Q79885 


human 


immun 


978 


3 


10, 


7 


35 


15 


Q72954 


Q72954 


human 


immun 


979 


3 


10- 


7 


35 


15 


071540 


071540 


human 


immun 


980 


3 


10. 


,7 


35 


15 


040564 


040564 


human 


immun 


981 


3 


10. 


,7 


35 


15 


Q78812 


Q78812 


human 


immun 


982 


3 


10- 


,7 


35 


15 


071534 


071534 


human 


immun 


983 


3 


10. 


.7 


35 


15 


Q9QR10 


Q9qrl0 


human 


immun 


984 


3 


10. 


.7 


35 


15 


Q80457 


Q80457 


human 


immun 


985 


3 


10. 


.7 


35 


15 


040563 


040563 


human 


immun 


986 


3 


10. 


,1 


35 


15 


Q80609 


Q80609 


human 


immun 


987 


3 


10, 


,1 


35 


15 


Q70215 


Q70215 


human 


immun 


988 


3 


10, 


.7 


35 


15 


Q79494 


Q79494 


human 


immun 


989 


3 


10 


.7 


35 


15 


091371 


091371 


human 


immun 


990 


3 


10 


.7 


35 


15 


Q9IPY8 


Q9ipY8 


human 


immun 


991 


3 


10 


.7 


35 


15 


Q78049 


Q78049 


human 


immun 


992 


3 


10 


-7 


35 


15 


Q79477 


Q79477 


human 


immun 


993 


3 


10 


.7 


35 


15 


Q77703 


Q77703 


human 


immun 


994 


3 


10 


.7 


35 


15 


091436 


091436 


human 


immun 


995 


3 


10 


.7 


35 


15 


Q70322 


Q70322 


human 


immun 


996 


3 


10 


.7 


35 


15 


Q9IPX8 


Q9ipx8 


human 


immun 


997 


3 


10 


.7 


35 


15 


091441 


091441 


human 


immun 


998 


3 


10 


,7 


35 


15 


Q79478 


Q79478 


human 


immun 


999 


3 


10 


.7 


35 


15 


Q79481 


Q79481 


human 


immun 


1000 


3 


10 


.7 


35 


15 


Q79997 


Q79997 


human 


immun 



ALIGNMENTS 



RESULT 1 
Q91Y90 

ID Q91Y90 PRELIMINARY; PRT; 31 AA. 

AC Q91Y90; 

DT Ol-DEC-2 001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 {TrEMBLrel. 19, Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Parathyroid hormone (Fragment) . 

GN PTH . 

OS Peromyscus maniculatus (Deer mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Sigmodontinae; 

OC Peromyscus. 

OX NCBI_TaxID=10042 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Prince K.L., Dewey M.J.; 

RL Submitted (MAY-2001) to the EMBL/GenBank/DDBJ databases . 

DR EMBL; AF382953; AAK63072.1; 

DR InterPro; IPR001415; Parathyrd_hrm. 

DR InterPro; IPR003625; Pthyrhorm_sub . 



DR Pfam; PF01279; Parathyroid; 1. 

DR ProDom; PD010687; Pthyrhorm_sub; 1, 

DR PROSITE; PS 0033 5; PARATHYROID; 1. 

FT NON_TER 1 1 

FT NON_TER 31 31 

SQ SEQUENCE 31 AA; 3461 MW; A208B0E772B9B55B CRC64; 



Query Match 17.9%; Score 5; DB 11; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



3 HNLGK 7 

Mill 
21 HNLGK 25 



RESULT 2 
Q91Y91 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
FT 
FT 
SQ 



Q91Y91 PRELIMINARY; PRT; 31 AA. 

Q91Y91; 

Ol-DEC-2001 (TrEMBLrel. 19, Created) 

Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

Parathyroid hormone (Fragment) . 

PTH. 

Peromyscus polionotus (Oldfield mouse) . 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Sigmodontinae; 



Eukaryota; Metazoa; Chordata; 
Mammalia ; Eutheria ; Rodent ia ; 
Peromyscus . 
NCBI_TaxID=42413; 
[1] 

SEQUENCE FROM N.A. 
Prince K.L,, Dewey M. J. ; 

Submitted (MAY-2001) to the EMBL/GenBank/DDBJ databases. 

EMBL; AF382952; AAK63071.1; 

Inter Pro ; IPR001415; Para thy rd_hrm . 

I nt er Pro ; I PRO 0 3 6 2 5 ; P thy rhorm_sub . 

Pfam; PF01279; Parathyroid; 1. 

ProDom; PD010687; Pthyrhorm_sub; 1. 

PROSITE; PS0033 5; PARATHYROID; 1. 

NON_TER 1 1 

NON_TER 31 31 

SEQUENCE 31 AA; 3461 MW; A208B0E772B9B55B CRC64 ; 



Query Match 17.9%; Score 5; DB 11; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 
Db 



3 HNLGK 7 

Mill 
21 HNLGK 25 



RESULT 3 
Q8KCG1 

ID Q8KCG1 PRELIMINARY; PRT; 31 AA. 

AC Q8KCG1; 



DT Ol-OCT-2002 (TrEMBLrel . 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Hypothetical protein CT1460. 

GN CT1460. 

OS Chlorobium tepidum, 

OC Bacteria; Chlorobi; Chlorobia; Chlorobiales ; Chlorobiaceae; 

OC Chlorobium. 

OX NCBI_TaxID=1097 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=TLS / ATCC 4 9652 / DSM 12 025; 

RX MEDLINE=22103685; PubMed-12093901 ; 

RA Eisen J. A., Nelson K.E., Paulsen I.T., Heidelberg J.F., Wu M. , 

RA Dodson R.J., Deboy R. , Gwinn M.L., Nelson W.C., Haft D.H. , 

RA Hickey E.K., Peterson J.D., Durkin A.S., Kolonay J.L., Yang F. , 

RA Holt I., Umayam L.A., Mason T. , Brenner M., Shea T.P., Parksey D. , 

RA Nierman W.C., Feldblyum T.V., Hansen C.L., Craven M.B., Radune D. , 

RA Vamathevan J., Khouri H. , White O. , Gruber T.M. , Ketchum K.A. , 

RA Venter J.C, Tettelin H. , Bryant D.A. , Fraser CM.; 

RT "The complete genome sequence of Chlorobium tepidum TLS, a 

RT photosynthetic, anaerobic, green- sulfur bacterium."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:9509-9514(2002). 

DR EMBL; AE012903; AAM72688.1; -. 

DR TIGR; CT1460; -. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 31 AA; 3539 MW; 0353C55E1E8 185CC CRC64 ; 

Query Match 17.9%; Score 5; DB 16; Length 31; 

Best Local Similarity 100.0%; Fred. No. 3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 18 LHHLI 22 

Mill 

Db 24 LHHLI 28 



RESULT 4 
Q8JAC6 

ID Q8JAC6 PRELIMINARY; PRT; 35 AA. 

AC Q8JAC6; 

DT Ol-OCT-2002 (TrEMBLrel. 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Gpl2 0 (Fragment) . 

GN ENV. 

OS Human immunodeficiency virus 1. 

OC Viruses; Retroid viruses; Retroviridae; Lentivirus. 

OX NCBI_TaxID=11676 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22068028; PubMed=12 072500 ; 

RA Kijak G.H., Simon V., Balfe P., Vanderhoeven J., Pampuro S.E., 

RA Zala C, Ochoa C. , Cahn P., Markowitz M. , Salomon H. ; 

RT "Origin of Human Immunodeficiency Virus Type 1 Quasispecies Emerging 

RT after Ant i retroviral Treatment Interruption in Patients with 

RT Therapeutic Failure."; 



RL J. Virol- 76:7000-7009(2002). 

DR EMBL; AF470757; AAM74604 . 1 ; 

DR InterPro; IPR000777; GP120. 

DR Pfam; PF00516; GP120; 1. 

KW AIDS; Coat protein; Glycoprotein. 

FT NON_TER 1 1 

FT NON_TER 35 35 

SQ SEQUENCE 35 AA; 3734 MW; DE5E0E3DEC9BE2B0 CRC64 ; 



Query Match 17.9%; Score 5; DB 15; Length 35; 

Best Local Similarity 100.0%; Pred. No. 3.4e+02; 
Matches 5; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



6 GKSIQ 10 

Mill 
9 GKSIQ 13 



RESULT 5 
Q51595 

ID Q51595 PRELIMINARY; PRT; 39 AA. 

AC Q51595; 

DT Ol-NOV-1996 (TrEMBLrel . 01, Created) 

DT Ol-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Acetoin:DCPIP oxidoreductase alpha and beta subunits, lipoat synthase, 

DE dihydrolipoamide dehydrogenase and ferredoxin (AcoA,B,C,S, and L) genes 

DE (Fragment) . 

OS Pelobacter carbinolicus . 

OC Bacteria; Proteobacteria ; Deltaproteobacteria ; Desulfuromonadales ; 

OC Pelobacteraceae; Pelobacter, 

OX NCBI_TaxID=19; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Gra Bd 1; 

RA Oppermann F.B., Steinbuechel A.; 

RT "Identification and molecular characterization of the aco genes 

RT encoding the Pelobacter carbinolicus acetoin dehydrogenase enzyme 

RT system."; 

RL Submitted (MAR-1996) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; L24124; AAA91874.1; 

FT NON_TER 1 1 

FT NON_TER 3 9 3 9 

SQ SEQUENCE 39 AA; 4256 MW; AE9D84F64CD1126A CRC64 ; 



Query Match 17.9%; Score 5; DB 2; Length 39; 

Best Local Similarity 100.0%; Pred. No. 3.7e+02; 
Matches 5; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



12 LRRRF 16 

Mill 

22 LRRRF 26 



RESULT 6 
Q8GZQ8 

ID Q8GZQ8 PRELIMINARY; PRT; 28 AA. 



AC Q8GZQ8 ; 

DT Ol-MAR-2003 (TrEMBLrel . 23, Created) 

DT Ol-MAR-2003 (TrEMBLrel . 23, Last sequence update) 

DT Ol-iyiAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE SNF-1 related kinase (Fragment) . 

GN BKIN12 . 

OS Hordeum vulgare var. distichum (Two-rowed barley) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta ; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta ; Liliopsida; Poales; Poaceae; Pooideae; 

OC Triticeae; Hordeum. 

OX NCBI_TaxID=112509; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Igri ; 

RA Clark J.S.C., Dani M., Barker J.H.A., Halford N.G., Karp A. ; 

RT "Bkinl2 Promoter Variants - Examples of Functional Biodiversity?"; 

RL Submitted (NOV-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF448389; AAN76447.1; 

KW Kinase. 

FT NON_TER 28 28 

SQ SEQUENCE 28 AA; 2950 MW; 853EDC11F6BB2C6C CRC64 ; 



Query Match 14.3%; 
Best Local Similarity 100.0%, 
Matches 4; Conservative 

Qy 4 NLGK 7 

MM 

Db 18 NLGK 21 



Score 4; DB 10; Length 28; 
Pred. No. 3.2e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



0; 



RESULT 7 
Q67781 

ID Q67781 PRELIMINARY; PRT; 28 AA. 

AC Q67781; 

DT 01 -NOV- 1996 (TrEMBLrel. 01, Created) 

DT Ol-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last annotation update) 

DE PTP (Fragment) . 

GN PTP . 

OS Human adenovirus type 31. 

OC Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mast adenovirus 

OX NCBI_TaxID=1052 9; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC cat# VR-1109; 

RA Ma Y., Mathews M.B.; 

RT "The structure, function, and evolution of adenovirus VA RNA: a 

RT phylogenetic approach."; 

RL Submitted (MAR-1996) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; U52568; AAB00302.1; -. 

FT NON_TER 1 1 

FT NON_TER 28 28 

SQ SEQUENCE 28 AA; 3282 MW; 6601C5E66A98ACF6 CRC64 ; 



Query Match 14.3%; Score 4; DB 12; Length 28; 

Best Local Similarity 100.0%; Pred. No. 3.2e+03; 



Matches 



4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



8 SIQD 11 

MM 

1 SIQD 4 



RESULT 8 
Q67780 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 



RL 
DR 
FT 
FT 
SQ 



Created) 

Last sequence update) 
Last annotation update) 



Q67780 PRELIMINARY; PRT; 28 AA . 

Q67780; 

01 -NOV- 19 96 (TrEMBLrel. 01, 
01 -NOV- 19 96 (TrEMBLrel. 01, 
Ol-DEC-2001 (TrEMBLrel- 19, 
FTP (Fragment) . 
FTP. 

Human adenovirus type 18. 

Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Ma st adenovirus . 
NCBI_TaxID=10528; 
[1] 

SEQUENCE FROM N.A. 
STRAIN-ATCC cat# VR-1095; 
Ma Y., Mathews M.B.; 

"The structure, function, and evolution of adenovirus VA RNA: a 
phylogenetic approach."; 
Submitted (MAR-1996) to the EMBL/GenBank/DDBJ databases. 
EMBL; U52567; AAB00300.1; -. 
NON_TER 1 1 

NON_TER 28 28 

SEQUENCE 28 AA; 3301 MW; 92 1195F72E98B9FE CRC64 ; 



Query Match 14 .3%; 

Best Local Similarity 100.0^ 
Matches 4; Conservative 



Score 4; DB 12; Length 28; 
Fred. No. 3.2e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



8 SIQD 11 

MM 
1 SIQD 4 



RESULT 9 
Q9R4S6 

ID Q9R4S6 PRELIMINARY; PRT; 29 AA. 

AC Q9R4S6; 

DT Ol-MAY-2000 (TrEMBLrel. 13, Created) 

DT Ol-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT Ol-JUN-2000 (TrEMBLrel. 14, Last annotation update) 

DE Oligopeptidase (Fragment) . 

OS Treponema denticola, 

OC Bacteria; Spirochaetes ; Spirochaetales ; Spirochaetaceae; Treponema. 

OX NCBI_TaxID=158 ; 

RN [1] 

RP SEQUENCE . 

RX MEDLINE=95168853; PubMed=7864624 ; 

RA Makinen K.K., Makinen P.L., Loesche W.J., Syed S.A.; 

RT "Purification and general properties of an oligopeptidase from 

RT Treponema denticola ATCC 35405--a human oral spirochete."; 



RL Arch. Biochem. Biophys. 316:689-698(1995). 

SQ SEQUENCE 29 AA; 3484 MW; 2F614C2387CCF68C CRC64 ; 



Query Match 14.3%; Score 4; DB 2; Length 29; 

Best Local Similarity 100.0%; Pred. No. 3.3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 22 lAEI 25 

MM 

Db 11 lAEI 14 



RESULT 10 
Q96PP3 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RA 
RA 
RA 
RT 
RT 
RT 
RL 
DR 
FT 
FT 
SQ 



( Fragment ) 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Q96PP3 PRELIMINARY; PRT; 29 AA . 

Q96PP3 ; 

Ol-DEC-2001 (TrEMBLrel. 19, Created) 
Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 
Ol-DEC-2001 (TrEMBLrel. 19, Last annotation update) 
Lympho-epithelial Kazal type-related inhibitor LEKTI 
SPINK5. 

Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 

Sprecher E. , Chavanas S., DiGiovanna J.J., Amin S., Nielsen K. , 
Prendiville J.S., Silverman R,, Esterly N.B., Spraker M.K., Guelig E. 
de Luna M.L., Williams M.L., Buehler B., Pfendner E., Bale S.J., 
Uitto J., Hovnanian A., Richard G. ; 

"The spectrum of pathogenic mutations in SPINK 5 in 19 families with 
Netherton syndrome - Implications for mutation detection and first 
case of prenatal diagnosis."; 
J. Invest. Dermatol. 0:0-0(2001). 
EMBL; AF295783; AAK97140.1; -. 
NON_TER 1 1 

NON_TER 2 9 2 9 

SEQUENCE 29 AA; 3449 MW; 



Query Match 14.3%; 
Best Local Similarity 100.0% 
Matches 4; Conservative 



9F31E2AD857EC1BB CRC64 ; 

Score 4; DB 4; Length 29; 
Pred. No. 3.3e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



4 NLGK 7 

MM 

14 NLGK 17 



RESULT 11 
P82235 

ID P82235 PRELIMINARY; 
AC P82235; 

DT Ol-MAY-2000 (TrEMBLrel. 
DT Ol-MAy-2000 (TrEMBLrel. 
DT Ol-MAY-2000 (TrEMBLrel. 
DE Brevinin-2TD. 



PRT; 



29 AA. 



13, Created) 

13, Last sequence update) 
13, Last annotation update) 



OS Rana tempo raria (European common frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Neobatrachia ; Ranoidea; Ranidae; Rana. 

OX NCBI_TaxID=84 07; 

RN [1] 

RP SEQUENCE . 

RC TISSUE=SKIN SECRETION; 

RX MEDLINE=99266278; PubMed=10333736 ; 

RA Simmaco M. , Mignogna G, , Barra D. ; 

RT "Antimicrobial peptides from amphibian skin: what do they tell us?"; 

RL Biopolymers 47:435-450(1998). 

CC -!- FUNCTION: SHOWS ANTIBACTERIAL ACTIVITY AGAINST REPRESENTATIVE 
CC GRAM-NEGATIVE AND GRAM-POSITIVE BACTERIAL SPECIES. 

CC -I- SUBCELLULAR LOCATION: SECRETED. 

CC -!- TISSUE SPECIFICITY: SKIN. 

CC -!- SIMILARITY: BELONGS THE BREVI N IN/ ESCULENT I N/GAEGURIN/RUGOS IN 
CC FAMILY. 

KW Amphibian skin; Antibiotic. 

FT DISULFID 23 29 BY SIMILARITY. 

SQ SEQUENCE 29 AA; 3234 MW; DDF8F76F48F1273C CRC64; 

Query Match 14.3%; Score 4; DB 13; Length 29; 

Best Local Similarity 100.0%; Pred. No. 3.3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 LHNL 5 

MM 

Db 18 LHNL 21 

RESULT 12 
Q8KYE4 

ID Q8KYE4 PRELIMINARY; PRT; 30 AA. 

AC Q8KYE4; 

DT Ol-OCT-2002 (TrEMBLrel . 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Hypothetical protein. 

GN BXB0050. 

OS Bacillus anthracis . 

OG Plasmid pX02 . 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=1392 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=A2012; 

RX MEDLINE=22061436; PubMed=12004 073 ; 

RA Read T.D. , Salzberg S.L., Pop M. , Shumway M., Umayam L. , Jiang L., 

RA Holtzapple E., Busch J.D., Smith K.L. , Schupp J.M. , Solomon D. , 

RA Keim P., Eraser CM.; 

RT "Comparative Genome Sequencing for Discovery of Novel Polymorphisms 

RT Bacillus anthracis."; 

RL Science 296:2028-2033(2002). 

DR EMBL; AE011191; AAM26208.1; 

KW Hypothetical protein; Plasmid. 

SQ SEQUENCE 30 AA; 3515 MW; B3352D9BB791AEDF CRC64 ; 



Query Match 14.3%; Score 4; DB 2; Length 30; 

Best Local Similarity 100.0%; Pred. No. 3-4e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 7 KSIQ 10 

Db 4 KSIQ 7 



RESULT 13 
Q9JMV3 

ID Q9JMV3 PRELIMINARY; PRT; 30 AA. 

AC Q9 JMV3 ; 

DT Ol-OCT-2000 (TrEMBLrel . 15, Created) 

DT Ol-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT Ol-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Luc if erase alpha -subun it (Fragment) . 

GN LUXA. 

OS Escherichia coli. 

OC Bacteria ; Proteobacteria ; Gammaproteobact eria ; Enterobacteriales ; 

OC Enterobacteriaceae; Escherichia. 

OX NCBI_TaxID=562 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=HB101; 

RA Lotz W., Bauer T.; 

RT "luxAB/kan-cassette for site-directed insertion mutagenesis and 

RT bacterial transcription studies."; 

RL Submitted (SEP-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=HB101; 

RA Olsson 0., Koncz C. , Szalay A.; 

RT "The use of luxA gene ofthe bacterial luciferase operon as a reporter 

RT gene . " ; 

RL Mol. Gen. Genet. 215:1-9(1998). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN^HBlOl; 

RX MEDLINE=92114868; PubMed=1685011 ; 

RA EscherA., O'KaneD.J., Szalay A. ; 

RT "The beta subunit polypeptide of Vibrio harveyi luciferase determines 

RT light emission at 42 degrees C"; 

RL Mol. Gen. Genet. 230:385-393(1991). 

DR EMBL; AJ249443; CAB96206-1; -. 

DR. HSSP; P0774 0; ILUC. 

DR InterPro; IPR002103; Bac_lucif erase . 

DR Pfam; PF00296; bac_lucif erase; 1. 

FT NON_TER 30 3 0 

SQ SEQUENCE 30 AA; 3454 MW; 2FC87235BDBE72FD CRC64 ; 

Query Match 14.3%; Score 4; DB 2; Length 30; 

Best Local Similarity 100.0%; Pred. No. 3.4e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



4 NLGK 7 

MM 



Db 



26 NLGK 29 



RESULT 14 
Q32617 

ID Q32617 PRELIMINARY; PRT; 30 AA. 

AC Q32617; 

DT Ol-NOV-1996 (TrEMBLrel . 01, Created) 

DT Ol-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last annotation update) 

DE ORF3 0- 

OS Marchantia polymorpha (Liverwort) . 

OG Chloroplast . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta ; Marchantiophyta ; 

OC Marchantiopsida ; Marchant iidae; Marchantiales ; Marchantiineae; 

OC Marchant iaceae; Marchantia. 

OX NCBI_TaxID=3197; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Ohyama K. , Fukuzawa H. , Kohchi T. , Shirai H, , Sano T, , Sano S., 

RA Umesono K. , Shiki Y., Takeuchi M. , Chang Z., Aota S., Inokuchi H., 

RA Ozeki H. ; 

RT "Chloroplast gene organization deduced from complete sequence of 

RT liverwort Marchantia polymorpha chloroplast DNA."; 

RL Nature 322:572-574(1986). 

RN [2] 

RP SEQUENCE FROM N.A, 

RX MEDLINE=84247325; PubMed=673 92 92 ; 

RA Yamano Y. , Ohyama K. , Komano T. ; 

RT "Nucleotide sequences of chloroplast 5S ribosomal RNA from cell 

RT suspension cultures of the liverworts Marchantia polymorpha and 

RT Jungermannia subulata."; 

RL Nucleic Acids Res. 12:4621-4624(1984). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85087956; PubMed=6393057 ; 

RA Umesono K. , Inokuchi H. , Ohyama K. , Ozeki H. ; 

RT "Nucleotide sequence of Marchantia polymorpha chloroplast DNA: a 

RT region possibly encoding three tRNAs and three proteins including a 

RT homologue of E. coli ribosomal protein S14."; 

RL Nucleic Acids Res. 12:9551-9565(1984). 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Fukuzawa H. , Uchida Y. , Yamano Y. , Ohyama K. , Komano T. ; 

RT "Molecular cloning of promoters functional in Escherichia coli from 

RT chloroplast DNA of a liverwort, Marchantia polymorpha."; 

RL Agric. Biol. Chem. 49:2725-2731(1985). 

RN [5] 

RP SEQUENCE FROM N.A. 

RA Yamano Y., Kohchi T., Fukuzawa H., Ohyama K. , Komano T, ; 

RT "Nucleotide sequences of chloroplast 4.5 S ribosomal RNA from a leafy 

RT liverwort, Jungermannia subulata, and a thai 1 old liverwort, Marchantia 

RT polymorpha . " ; 

RL FEBS Lett. 185:203-207(1985). 

RN [6] 

RP SEQUENCE FROM N.A. 

RA Fukuzawa H. , Kohchi T. , Shirai H. , Ohyama K. , Umesono K. , Inokuchi H., 



RA Ozeki H. ; 

RT "Coding sequences for chloroplast ribosomal protein S12 from the 

RT liverwort, Marchantia polymorpha, are separated far apart on the 

RT different DNA strands,"; 

RL FEES Lett. 198:11-15(1986). 

RN [7] 

RP SEQUENCE FROM N.A. 

RA Fukuzawa H. , Yoshida T. , Kohchi T., Okumura T. , Sawano Y., Ohyama K. ; 

RT "Splicing of group II introns in mRNAs coding for cytochrome b6 and 

RT subunit IV in liverwort Marchantia polymorpha chloroplast genome: Exon 

RT specifying a region coding for two genes with the spacer region.";, 

RL FEES Lett. 220:61-66(1987). 

RN [8] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89028845; PubMed=284618 9 ; 

RA Kohchi T. , Ogura Y. , Umesono K. , Yamada Y. , Komano T. , Ozeki H. , 

RA Ohyama K. ; 

RT "Ordered processing and splicing in a polycistronic transcript in 

RT liverwort chloroplasts . " ; 

RL Curr. Genet. 14:147-154(1988). 

RN [9] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=890686B7; PubMed=3 199436 ; 

RA Fukuzawa H. , Kohchi T. , Sano T. , Shirai H. , Umesono K. , Inokuchi H. , 

RA Ozeki H. , Ohyama K. ; 

RT "Structure and organization of Marchantia polymorpha chloroplast 

RT genome. III. Gene organization of the large single copy region from 

RT rbcL to trnl(CAU) ."; 

RL J. Mol. Biol. 203:333-351(1988). 

RN [10] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91212240; PubMed=17084 98 ; 

RA Shimada H. , Sugiura M. ; 

RT "Fine structural features of the chloroplast genome: comparison of the 

RT sequenced chloroplast genomes."; 

RL Nucleic Acids Res. 19:983-995(1991). 

RN [11] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89068685; PubMed=2462 054 ; 

RA Ohyama K. , Fukuzawa H. , Kohchi T. , Sano T, , Sano S., Shirai H. , 

RA Umesono K. , Shiki Y,, Takeuchi M. , Chang Z., Acta S.i., Inokuchi H. , 

RA Ozeki H. ; 

RT "Structure and organization of Marchantia polymorpha Chloroplast 

RT genome. I. cloning and gene identification."; 

RL J. Mol. Biol. 203:281-298(1988). 
RN [12] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89068686; PubMed=2974 085 ; 

RA Umesono K. , Inokuchi H. , Shiki Y. , Takeuchi M. , Chang Z., Fukuzawa H., 

RA Kohchi T. , Shirai H. , Ohyama K. , Ozeki H. ; 

RT "Structure and organization of Marchantia polymorpha chloroplast 

RT genome. II. gene organization of the large single copy region from 

RT rps ' 12 to atpB. " ; 

RL J. Mol. Biol. 203:299-331(1988). 
RN [13] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89068688; PubMed=3199437 ; 



RA Kohchi T., Shirai H. , Fukuzawa H. , Sano T. , Komano T. , Umesono K. , 

RA Inokuchi H., Ozeki H. , Ohyama K. ; 

RT "Structure and organization of Marchantia polymorpha chloroplast 

RT genome. IV. inverted repeat and small single copy regions ." ; 

RL J. Mol. Biol. 203:353-372(1988). 

DR EMBL; X04465; CAA28070.1; 

KW Chloroplast . 

SQ SEQUENCE 30 AA; 3842 MW; 41C170779F249D58 CRC64 ; 

Query Match 14,3%; Score 4; DB 8 ; Length 30; 

Best Local Similarity 100.0%; Pred. No. 3.4e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 15 RFFL 18 

MM 

Db 24 RFFL 27 



RESULT 15 
Q8MXE8 

ID Q8MXE8 PRELIMINARY; PRT; 31 AA. 

AC Q8MXE8 ; 

DT Ol-OCT-2002 (TrEMBLrel . 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein K07A9.4. 

GN K07A9.4. 

OS Caenorhabditis elegans. 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea ; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI_TaxID=6239 ; 

RN [1] 

RP SEQUENCE FROM M.A. 

RC STRAIN=Bristol N2 ; 

RX MEDLINE=99069613; PubMed=9851916 ; 

RA Waterston R. ; 

RT "Genome sequence of the nematode C. elegans: a platform for 

RT investigating biology. The C. elegans Sequencing Consortium."; 

RL Science 282:2012-2018(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bristol N2 ; 

RA Davidson S., O'Neal D. ; 

RT "The sequence of C. elegans cosmid K07A9."; 

RL Submitted (OCT-1998) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bristol N2 ; 

RA Waterston R. ; 

RL Submitted (AUG-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF099924; AAM98005.1; 

DR WormPep; K07A9.4; CE3170 9. 

KW Hypothetical protein. 

SQ SEQUENCE 31 AA; 3720 MW; 147938913DC940ED CRC64 ; 



Query Match 14.3%; Score 4; DB 5; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 



Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 16 FFLH 19 

MM 

Db 21 FFLH 24 



RESULT 16 
Q9MS77 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OG 



(TrEMBLrel, 15, Created) 

(TrEMBLrel. 15, Last sequence update) 

(TrEMBLrel. 19, Last annotation update) 
I protein M. 



Q9MS77 PRELIMINARY; PRT; 31 AA. 

Q9MS77; 
Ol-OCT-2000 
Ol-OCT-2000 
Ol-DEC-2001 
Photosystem 
PSAM. 

Phacus acuminata . 
Chloroplast . 

OC Eukaryota; Euglenozoa; Euglenida; Euglenales; Phacus. 
OX NCBI_TaxID=130316; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21080550; PubMed=11212923 ; 

RA Doetsch N.A., Thompson M.D., Favreau M.R., Hallick R.B.; 
RT "Comparison of psbK operon organization and group III intron content 
RT in chloroplast genomes of 12 Euglenoid species."; 
RL Mol- Gen. Genet. 264:682-690(2001). 
DR EMBL; AF241276; AAF82438.1; 
KW Chloroplast. 
SQ SEQUENCE 31 AA; 3449 MW 



2FFB2AF4B4ACDEC8 CRC64; 



Query Match 14.3%; Score 4; DB 8; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



4 NLGK 7 

MM 
24 NLGK 27 



RESULT 17 
Q9DFI7 

ID Q9DFI7 PRELIMINARY; PRT; 31 AA. 

AC Q9DFI7; 

DT Ol-MAR-2001 (TrEMBLrel. 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Double-sex like protein Dmrt5 (Fragment) . 

OS Monopterus albus (swamp eel) . 

OC Eiikaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha; Synbranchif ormes ; 

OC Synbranchidae; Monopterus. 

OX NCBI_TaxID-43700; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Blood; 



RA Huang X . , Zhou R . ; 

RT "A conserved family of doublesex related genes from fish."; 

RL Submitted (MAY-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF272957; AAG18566.1; 

DR InterPro; IPR001275; DM_DNA-binding . 

DR Pfam; PF00751; DM-domain; 1. 

DR PROSITE; PS50809; DM_D0MAIN_2 ; 1. 

FT NON_TER 1 1 

FT NON_TER 31 31 

SQ SEQUENCE 31 AA; 3633 MW; 33A2E9B0B118DE2D CRC64 ; 



Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



Qy 
Db 



21 LIAE 24 

MM 
24 LIAE 27 



RESULT 18 
Q9DEW2 

ID Q9DEW2 PRELIMINARY; PRT; 31 AA. 

AC Q9DEW2; 

DT Ol-MAR-2001 (TrEMBLrel . 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Doublesex-related protein Dmrtl5 (Fragment) . 

GN DMRT15. 

OS Coturnix coturnix (Common quail) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gall i formes; Phasianidae; Phasianinae 

OC Coturnix. 

OX NCBI_TaxID-9091 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Blood; 

RA Ren L . , Zhou R . ; 

RT "Evolutionary conseirvation of doublesex-related gene family in 

RT amphibian, reptile and bird."; 

RL Submitted (JUL-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY005392; AAG15564.1; -. 

DR InterPro; IPR001275; DM_DNA-binding . 

DR Pfam; PF00751; DM-domain; 1. 

DR PROSITE; PS50809; DM_D0MAIN_2; 1. 

FT N0N_TER 1 1 

FT N0N_TER 31 31 

SQ SEQUENCE 31 AA; 3593 MW; 211519B0B7B3D92D CRC64; 

Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 21 LIAE 24 

MM 

Db 24 LIAE 27 



RESULT 
Q9DFH7 



19 



ID 
AC 
DT 
DT 
DT 
DE 
OS 
OC 
OC 
OC 
OC 
OX 
RN 
RP 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
FT 
FT 
SQ 



Q9DFH7 PRELIMINARY; PRT; 31 AA. 

Q9DFH7 ; 

Ol-MAR-2001 (TrEMBLrel . 
Ol-MAR-2001 (TrEMBLrel . 
Ol-MAR-2003 (TrEMBLrel , 
Doublesex-like protein 
Mastacembelus aculeatus . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Actinopterygii; Neopterygii; Teleostei; Euteleostei, 



16, Created) 

16, Last sequence update) 
23, Last annotation update) 
DmrtG (Fragment) , 



Euteleostomi ; 
Neoteleostei ; 



Acanthomorpha; Acanthopterygii ; Percomorpha; Synbranchi formes ; 
Mastacembelidae; Mastacembelus . 
NCBI_TaxID=138129; 
[1] 

SEQUENCE FROM N.A. 
Huang X . , Zhou R . ; 

"A conserved family of doublesex related genes from fish."; 
Submitted (MAY-2000) to the EMBL/GenBank/DDBJ databases, 
EMBL; AF272971; AAG18556.1; -. 
InterPro; IPR001275 ; DM_DNA-binding . 
Pfam; PF00751; DM-domain; 1. 
PROSITE; PS50809; DM_D0MAIN_2 ; 1. 
NON_TER 1 1 

NON_TER 31 31 

SEQUENCE 31 AA; 3653 MW; 2 11519B0A76 9B92D CRC64; 



Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



21 LIAE 24 

MM 

24 LIAE 27 



RESULT 20 
Q9DFI2 

ID Q9DFI2 PRELIMINARY; PRT; 31 AA. 

AC Q9DFI2; 

DT Ol-MAR-2001 (TrEMBLrel. 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Doublesex-like protein Dmrt6 (Fragment) . 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X . , Zhou R . ; 

RT "A conserved family of doublesex related genes from fish."; 

RL Submitted (MAY-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF272962; AAG18558.1; 

DR InterPro; IPR001275; DM_DNA-binding . 



DR Pfam; PF00751; DM-domain; 1. 

DR PROSITE; PS50809; DM_D0MAIN_2 ; 1. 

FT NON_TER 1 1 

FT NON_TER 3131 

SQ SEQUENCE 31 AA; 3653 MW; 2 11519B0A769B92D CRC64 ; 

Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3-5e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 21 LIAE 24 

MM 

Db 24 LIAE 27 



RESULT 21 
Q9DFH8 

ID Q9DFH8 PRELIMINARY; PRT; 31 AA. 

AC Q9DFH8 ; 

DT Ol-MAR-2001 (TrEMBLrel . 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Doublesex-like protein DmrtS (Fragment) . 

OS Mastacembelus aculeatus. 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopt erygi i ; Percomorpha ; Synbranchi formes ; 

OC Mastacembelidae; Mastacembelus. 

OX NCBI_TaxID=13812 9; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X . , Zhou R . ; 

RT "A conserved family of doublesex related genes from fish."; 

RL Submitted (MAY-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF272970; AAG18555-1; -. 

DR InterPro; IPR001275; DM_DNA-binding . 

DR Pfam; PF00751; DM-domain; 1. 

DR PROSITE; PS50809; DM_D0MAIN_2; 1. 

FT NON_TER 1 1 

FT NON_TER 31 31 

SQ SEQUENCE 31 AA; 3633 MW; 33A2E9B0B1 18DE2D CRC64; 



Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



21 LIAE 24 

MM 

24 LIAE 27 



RESULT 22 
Q9DFI6 

ID Q9DFI6 PRELIMINARY; PRT; 31 AA. 

AC Q9DFI6; 

DT Ol-MAR-2001 (TrEMBLrel. 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 



DT Ol-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Double-sex like protein Dmrt6 (Fragment) . 

OS Monopterus albus (swamp eel) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Synbranchif ormes ; 

OC Synbranchidae; Monopterus. 

OX NCBI_TaxID=43700; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Blood; 

RA Huang X . , Zhou R . ; 

RT "A conserved family of doublesex related genes from fish."; 

RL Submitted (MAY-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF272958; AAG18567.1; 

DR InterPro; IPR001275; DM_DNA-binding . 

DR Pfam; PF00751; DM-domain; 1. 

DR PROSITE; PS50809; DM_D0MAIN_2 ; 1. 

FT NON_TER 1 1 

FT NON_TER 31 31 

SQ SEQUENCE 31 AA; 3653 MW; 211519B0A769B92D CRC64 ; 

Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 21 LIAE 24 

MM 

Db 24 LIAE 27 



RESULT 23 
Q9DFI1 

ID Q9DFI1 PRELIMINARY; PRT; 31 AA. 

AC Q9DFI1; 

DT Ol-MAR-2001 (TrEMBLrel. 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Doublesex-like protein Dmrtll (Fragment) . 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X . , Zhou R . ; 

RT "A conserved family of doublesex related genes from fish."; 

RL Submitted (MAY-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF272963; AAG18559.1; 

DR InterPro; IPR001275; DM_DNA-binding . 

DR Pfam; PF00751; DM-domain; 1. 

FT NON_TER 1 1 

FT NON_TER 31 31 

SQ SEQUENCE 31 AA; 3734 MW; 2 1151 9BE401 9B92D CRC64; 



Query Match 



14.3%; Score 4; DB 13; Length 31; 



Best Local Similarity 100.0%; Pred. No. 3.5e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 21 LIAE 24 

MM 

Db 24 LIAE 27 



RESULT 24 
Q9DEW1 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
FT 
FT 
SQ 



Q9DEW1 PRELIMINARY; PRT; 31 AA. 

Q9DEW1; 

Ol-MAR-2001 (TrEMBLrel. 16, Created) 

Ol-MAR-2001 (TrEMBLrel- 16, Last sequence update) 

Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

Doublesex-related protein Dmrtl6 (Fragment) . 

DMRT16. 

Coturnix coturnix (Common quail) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae 
Coturnix. 
NCBI_TaxID=9091; 
[1] 

SEQUENCE FROM N.A. 
TISSUE=Blood; 
Ren L . , Zhou R . ; 

"Evolutionary conservation of doublesex-related gene family in 
amphibian, reptile and bird."; 

Submitted (JUL-2000) to the EMBL/GenBank/DDBJ databases. 

EMBL; AY0053 93; AAG15565.1; 

InterPro; IPR001275; DM_DNA-binding . 

Pfam; PF00751; DM-domaiu; 1. 

NON_TER 1 - 1 

NON_TER 31 31 

SEQUENCE 31 AA; 3525 MW; 4FC922DE73F934FB CRC64 ; 



Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



Qy 
Db 



21 LIAE 24 
24 LIAE 27 



RESULT 25 
Q9DEW3 

ID Q9DEW3 PRELIMINARY; PRT; 31 AA. 

AC Q9DEW3 ; 

DT Ol-MAR-2001 (TrEMBLrel. 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Doublesex-related protein Dmrt6 [Fragment) , 

GN DMRT6 . 

OS Coturnix coturnix (Common quail) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae 



OC Coturnix. 

OX NCBI_TaxID=9091 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Blood; 

RA Ren L . , Zhou R . ; 

RT "Evolutionary conservation of doublesex-related gene family in 

RT amphibian, reptile and bird."; 

RL Submitted (JUL-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY005391; AAG15563.1; -. 

DR InterPro; IPR001275; DM_DNA-binding . 

DR Pfam; PF00751; DM-domain; 1. 

DR PROSITE; PS50809; DM_D0MAIN_2; 1. 

FT N0N_TER 1 1 

FT N0N_TER 31 31 

SQ SEQUENCE 31 AA; 3653 MW; 2 11519B0A76 9B92D CRC64; 



Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 21 LIAE 24 

MM 

Db 24 LIAE 27 



RESULT 26 
Q90Z24 

ID Q90Z24 PRELIMINARY; PRT; 31 AA. 

AC Q90Z24; 

DT Ol-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Dmrtl (Fragment) . 

OS Monopterus albus (swamp eel) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha; Synbranchif ormes ; 

OC Synbranchidae; Monopterus. 

OX NCBI__TaxI D=4 3 700; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhou R . , Huang X . ; 

RT "Molecular cloning and expression of rice field eel Dmrtl."; 

RL Submitted (MAY-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF378149; AAK59253.1; -. 

DR InterPro; IPR001275; DM_DNA-binding . 

DR Pfam; PF00751; DM-domain; 1. 

DR PROSITE; PS50809; DM_D0MAIN_2; 1. 

FT NON_TER 1 1 

FT NON_TER 31 31 

SQ SEQUENCE 31 AA; 3753 MW; BB91FDB73A1F0ADD CRC64 ; 



Query Match 14.3%; Score 4; DB 13; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 21 LIAE 24 

MM 

Db 24 LIAE 27 



RESULT 27 
Q8KEV8 

ID Q8KEV8 PRELIMINARY; PRT; 31 AA. 

AC Q8KEV8; 

DT Ol-OCT-2002 (TrEMBLrel . 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Hypothetical protein CT0574 . 

GN CT0574 . 

OS Chlorobium tepidum. 

OC Bacteria; Chlorobi; Chlorobia; Chlorobiales ; Chlorobiaceae; 

OC Chlorobium. 

OX NCBI_TaxID=1097; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=TLS / ATCC 49652 / DSM 12025; 

RX MEDLINE=22103685; PubMed=12093901 ; 

RA Eisen J.A. , Nelson K.E., Paulsen I.T., Heidelberg J.F,, Wu M., 

RA Dodson R.J., Deboy R. , Gwinn M.L., Nelson W.C., Haft D.H., 

RA Hickey E.K., Peterson J.D., Durkin A.S., Kolonay J.L., Yang P., 

RA Holt I., Umayam L.A,, Mason T. , Brenner M,, Shea T.P., Parksey D,, 

RA Nierman W.C., Feldblyum T.V., Hansen C.L., Craven M.B., Radune D., 

RA Vamathevan J., Khouri H. , White 0., Gruber T.M., Ketchum K.A. , 

RA Venter J.C., Tettelin H., Bryant D.A., Fraser CM.; 

RT "The complete genome sequence of Chlorobium tepidum TLS, a 

RT photosynthetic, anaerobic, green-sulfur bacterium."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:9509-9514(2002). 

DR EMBL; AE012832; AAM71816,1; -. 

DR TIGR; CT0574; -. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 31 AA; 3734 MW; 588CF6AC05DA44 0B CRC64 ; 

Query Match 14.3%; Score 4; DB 16; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 21 LIAE 24 

MM 

Db 14 LIAE 17 



RESULT 28 
Q8KEV3 

ID Q8KEV3 PRELIMINARY; PRT; 31 AA. 

AC Q8KEV3; 

DT Ol-OCT-2002 (TrEMBLrel. 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Hypothetical protein CT0579. 

GN CT0579. 

OS Chlorobium tepidum. 

OC Bacteria; Chlorobi; Chlorobia; Chlorobiales; Chlorobiaceae; 



OC Chlorobium. 

OX NCBI_TaxID=1097; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=TLS / ATCC 49652 / DSM 12025; 

RX MEDLINE=22103685; PubMed=12 093901 ; 

RA Eisen J.A. , Nelson K.E., Paulsen I.T., Heidelberg J,F., Wu M. , 

RA Dodson R.J,, Deboy R. , Gwinn M.L., Nelson W,C., Haft D.H., 

RA Hickey E.K., Peterson J.D., Durkin A.S., Kolonay J.L., Yang F., 

RA Holt Umayam L.A. , Mason T. , Brenner M. , Shea T.P., Parksey D., 

RA Nierman W.C, Feldblyum T.V. , Hansen C.L. , Craven M.B., Radune D. , 

RA Vamathevan J., Khouri H. , White 0., Gruber T.M., Ketchum K.A. , 

RA Venter J.C., Tettelin H. , Bryant D.A. , Fraser CM.; 

RT "The complete genome sequence of Chlorobium tepidum TLS, a 

RT photosynthetic, anaerobic, green-sulfur bacterium."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:9509-9514(2002). 

DR EMBL; AE012832; AAM71821.1; -. 

DR TIGR; CT0579; 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 31 AA; 3617 MW; 265C9621F2F40044 CRC64; 

Query Match 14.3%; Score 4; DB 16; Length 31; 

Best Local Similarity 100.0%; Pred. No. 3.5e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 14 RRFF 17 

MM 

Db 4 RRFF 7 



RESULT 2 9 
Q96Q53 

ID Q96Q53 PRELIMINARY; PRT; 32 AA. 

AC Q96Q53; 

DT Ol-DEC-2001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last annotation update) 

DE Mitochondrial ribosomal protein L17 (Fragment) . 

GN MRPL17. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21429115; PubMed=11543634 ; 

RA Kenmochi N. , Suzuki T. , Uechi T. , Magoori M. , Kuniba M. , Higa S., 

RA Watanabe K. , Tanaka T. ; 

RT "The human mitochondrial ribosomal protein genes: Mapping of 54 genes 

RT to the chromosomes and implications for human disorders."; 

RL Genomics 77:65-70(2001). 

DR EMBL; AB051620; BAB54948.1; -- 

KW Ribosomal protein. 

FT NON_TER 1 1 

SQ SEQUENCE 32 AA; 3542 MW; 1322815AE11A6404 CRC64; 



Query Match 14.3%; Score 4; DB 4; Length 32; 



Best Local Similarity 100.0%; Pred. No. 3.6e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 10 QDLR 13 

INI 

Db 11 QDLR 14 



RESULT 30 
Q9DZ37 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OX 
RN 
RP 
RX 
RA 
RA 
RA 
RA 
RT 
RT 
RL 
DR 
KW 
FT 
FT 
SQ 



Created) 

Last sequence update) 
Last annotation update) 



Q9DZ37 PRELIMINARY; PRT; 32 AA. 

Q9DZ37; 

Ol-MAR-2001 (TrEMBLrel. 16, 
Ol-MAR-2001 {TrEMBLrel. 16, 
Ol-OCT-2002 (TrEMBLrel. 22, 
Pol polyprotein (Fragment) . 
POL. 

Human immunodeficiency virus 1. 

Viruses; Retroid viruses; Retroviridae; Lentivirus. 
NCBI_TaxID=11676 ; 
[1] 

SEQUENCE FROM N.A. 

MEDLINE=20461476 ; PubMed=11005867 ; 

Martinez-Picado J., DePasquale M.P., Kartsonis N. , Hanna G.J., 
Wong J., Finzi D., Rosenberg E., Gunthard H.F., Sutton L. , Savara A. 
Petropoulos C.J-, Hellmann N., Walker B.D., Richman D.D. , 
Siliciano R. , D'Aquila R.T.; 

"Antiretroviral resistance during successful therapy of HIV type 1 
infection. " ; 

Proc. Natl. Acad. Sci. U.S.A. 97:10948-10953(2000). 
EMBL; AF292795; AAG25401.1; 
Polyprotein. 
NON_TER 1 1 

NON_TER 32 32 

SEQUENCE 32 AA; 3951 MW; B6AD13 060D1EEA1A CRC64 ; 



Query Match 14.3%; 
Best Local Similarity 100.0% 
Matches 4; Conservative 



Score 4; DB 15; Length 32; 
Pred. No. 3.6e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



15 RFFL 18 

MM 
8 RFFL 11 



RESULT 31 
Q9P1T8 

ID Q9P1T8 PRELIMINARY; PRT; 33 AA. 

AC Q9P1T8; 

DT Ol-OCT-2000 (TrEMBLrel. 15, Created) 

DT Ol-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT Ol-JUN-2001 (TrEMBLrel. 17, Last annotation update) 

DE Mutant chemokine receptor CCR5 (Fragment) . 

GN CCR5 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



ox NCB I_TaxI D= 9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99392755; PubMed=10465086 ; 

RA Aikhionbare F.O., Newman C, Womack C. , Roth W.W., Stringer H.G. Jr., 

RA Bond V,C. ; 

RT "Characterization of a third CCR5 amplicon from CCR5-delta32- 

RT heterozygous HIV-l-inf ected individuals."; 

RL AIDS 13:1585-1586(1999). 

DR EMBL; AF052244; AAF65577.1; 

KW Receptor. 

FT NON_TER 1 1 

SQ SEQUENCE 33 AA; 3477 MW; 664294782E2DB3E1 CRC64 ; 

Query Match 14.3%; Score 4; DB 4; Length 33; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 5 LGKS 8 

MM 

Db 3 0 LGKS 33 



RESULT 32 
Q8N4J9 

ID Q8N4J9 PRELIMINARY; PRT; 33 AA. 

AC Q8N4J9; 

DT Ol-OCT-2002 (TrEMBLrel . 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Hypothetical protein (Fragment) . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Strausberg R.; 

RL Submitted (JUL-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC033964; AAH33964.1; 

KW Hypothetical protein. 

FT NON_TER 1 1 

SQ SEQUENCE 33 AA; 3884 MW; 1A0E9E343 07 0B4E1 CRC64; 



Query Match 14.3%; Score 4; DB 4; Length 33; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 LLHN 4 

MM 

17 LLHN 20 



RESULT 33 
Q9AR83 

ID Q9AR83 PRELIMINARY; PRT; 33 AA. 



AC Q9AR83; 

DT Ol-JUN-2001 (TrEMBLrel. 17, Created) 

DT Ol-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Putative metallothionein-like protein (Fragment) . 

OS Pinus pinaster (Maritime pine) . 

OC Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Conif eropsida ; Coniferales; Pinaceae; Pinus. 

OX NCBI_TaxID=71647 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Seedling aerial parts; 

RA Dubos C, Le Provost G. , Salin F., Lalane C, Madur D. , Frigerio J.M., 

RA Plomion C. ; 

RT "Identification and characterization of water-deficit responsive genes 

RT in Maritime pine (Pinus pinaster Ait.) seedlings."; 

RL Submitted (APR-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AJ309103; CAC35976.1; 

FT NON_TER 1 1 

SQ SEQUENCE 33 AA; 3610 MW; A89612AE71CD728C CRC64; 

Query Match 14,3%; Score 4; DB 10; Length 33; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 20 HLIA 23 

MM 

Db 3 HLIA 6 



RESULT 34 
Q9RI14 

ID Q9RI14 PRELIMINARY; PRT; 33 AA. 

AC Q9RI14; 

DT Ol-MAY-2000 (TrEMBLrel. 13, Created) 

DT Ol-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT Ol-MAR-2002 (TrEMBLrel. 20, Last annotation update) 

DE YPCD1.70C protein (Fragment) . 

GN YPCD1.70C- 

OS Yersinia pestis, 

00 Plasmid pCDl . 

OC Bacteria ; Proteobacteria ; Gammaproteobacteria ; Enterobacterial es ; 

OC Enterobacteriaceae; Yersinia. 

OX NCBI_TaxID=632; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CO-92 / Biovar Oriental is; PLASMID=pCDl ; 

RX MEDLINE=21470413; PubMed=1158 6360 ; 

RA Parkhill J., Wren B.W., Thomson N.R., Titball R.W. , Holden M.T.G. , 

RA Prentice M.B., Sebaihia M. , James K.D., Churcher C, Mungall K.L., 

RA Baker S,, Basham D., Bentley S.D., Brooks K. , Cerdeno-Tarraga A.M., 

RA Chillingworth T., Cronin A., Davies R.M. , Davis P., Dougan G., 

RA Feltwell T., Hamlin N., Holroyd S., Jagels K. , Karlyshev A, V. , 

RA Leather S., Moule S., Oyston P.C.F., Quail M. , Rutherford K. , 

RA Simmonds M. , Skelton J., Stevens K. , Whitehead S., Barrell B.G.; 

RT "Genome sequence of Yersinia pestis, the causative agent of plague."; 

RL Nature 413:523-527(2001). 



DR EMBL; AL117189; CAB54947.1; 

KVJ Plasmid; Complete proteome. 

FT N01Sf_TER 33 33 

SQ SEQUENCE 33 AA; 3882 MW; 



69B9646EC9FF34DE CRC64 ; 



Query Match 14.3%; 
Best Local Similarity 100.0% 
Matches 4; Conservative 



Score 4; DB 16; Length 33; 
Pred. No. 3.7e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



20 HLIA 23 

MM 

3 0 HLIA 33 



RESULT 35 
Q9R5U1 

ID Q9R5U1 PRELIMINARY; PRT; 34 AA. 

AC Q9R5U1 ; 

DT Ol-MAY-2000 (TrEMBLrel. 13, Created) 

DT Ol-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE PEB4-MAJ0R antigenic peptide (Fragment) . 

OS Campylobacter jejuni. 

OC Bacteria ; Proteobact eria ; Epsilonproteobacteria ; Campylobacterales ; 

OC Campy 1 oba c t e r a c ea e ; Campy 1 oba c t er . 

OX NCBI_TaxID=197; 

RN [1] 

RP SEQUENCE . 

RX MEDLINE-91358413 ; PubMed=18 85571 ; 

RA Pei Z.H., Ellison R.T.III., Blaser M. J. ; 

RT "Identification, purification, and characterization of major antigenic 

RT proteins of Campylobacter jejuni."; 

RL J. Biol. Chem. 266:16363-16369(1991). 

FT NON_TER 1 1 

FT NON_TER 34 34 

SQ SEQUENCE 34 AA; 3665 MW; 73BECEFA922507D2 CRC64; 

Query Match 14.3%; Score 4; DB 2; Length 34; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 6 GKSI 9 

MM 

Db 8 GKSI 11 



RESULT 36 
Q9RCC0 

ID Q9RCC0 PRELIMINARY; PRT; 34 AA. 

AC Q9RCC0; 

DT Ol-MAY-2000 (TrEMBLrel. 13, Created) 

DT Ol-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Ferrochelatase (EC 4.99.1.1) (Protoheme ferro-lyase) (Heme synthetase) 
DE (Fragment) . 
GN HEMH . 

OS Yersinia pseudotuberculosis (type 0:1b) . 



OC Bacteria ; Proteobacteria ; Gammaproteobacteria ; Enterobacteriales ; 

OC Enterobacteriaceae; Yersinia. 

OX NCBI_TaxID=109458; 

RN [1] 

RP SEQUENCE FROM N,A. 

RC STRAIN=pa3606; 

RX MEDLINE=20392461; PubMed=1093 1327 ; 

RA Skurnik M. , Peippo A., Ervela E.; 

RT "Characterization of the 0-antigen gene clusters of Yersinia 

RT pseudotuberculosis and the cryptic 0-antigen gene cluster of Yersinia 

RT pestis shows that the plague bacillus is most closely related to and 

RT has evolved from Y. pseudotuberculosis serotype 0:1b."; 

RL Mol. Microbiol. 37:316-330(2000). 

CC -I- FUNCTION: CATALYZES THE FERROUS INSERTION INTO PROTOPORPHYRIN IX 
CC (BY SIMILARITY) . 

CC CATALYTIC ACTIVITY: PROTOPORPHYRIN + FE(2+) = PROTOHEME + 2 H(+). 

CC -!- PATHWAY: PROTOHEME BIOSYNTHESIS; LAST STEP. 

CC -!- SIMILARITY: BELONGS TO THE FERROCHELATASE FAMILY. 

DR EMBL; AJ251712; CAB63288.1; -- 

DR InterPro; IPR001015; Ferrochelatase . 

DR Pfam; PF00762; Ferrochelatase; 1. 

KW Heme biosynthesis; Iron; Lyase; Porphyrin biosynthesis. 

FT NON_TER 1 1 

SQ SEQUENCE 34 AA; 3921 MW; 948552A9D8 0C2FE8 CRC64 ; 

Query Match 14.3%; Score 4; DB 2; Length 34; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 16 FFLH 19 

MM 

Db 1 FFLH 4 



RESULT 37 
Q9RCD1 

ID Q9RCD1 PRELIMINARY; PRT; 34 AA. 

AC Q9RCD1 ; 

DT Ol-MAY-2000 (TrEMBLrel . 13, Created) 

DT Ol-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Ferrochelatese (EC 4.99.1,1) (Ferrochelatase) (Protoheme ferro-lyase) 

DE (Heme synthetase) (Fragment) . 

GN HEMH . 

OS Yersinia pestis. 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; 

OC Enterobacteriaceae; Yersinia. 

OX NCBI_TaxID=632 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=EV76; 

RX MEDLINE=20392461; PubMed-1093 1327 ; 

RA Skurnik M. , Peippo A., Ervela E. ; 

RT "Characterization of the 0-antigen gene clusters of Yersinia 

RT pseudotuberculosis and the cryptic 0-antigen gene cluster of Yersinia 

RT pestis shows that the plague bacillus is most closely related to and 

RT has evolved from Y. pseudotuberculosis serotype 0:1b."; 



RL Mol. Microbiol. 37:316-330(2000). 

CC -!- FUNCTION: CATALYZES THE FERROUS INSERTION INTO PROTOPORPHYRIN IX 
CC (BY SIMILARITY) . 

CC -!- CATALYTIC ACTIVITY: PROTOPORPHYRIN + FE(2+) = PROTOHEME + 2 H(+). 

CC -!- PATHWAY: PROTOHEME BIOSYNTHESIS; LAST STEP. 

CC -!- SIMILARITY: BELONGS TO THE FERROCHELATASE FAMILY. 

DR EMBL; AJ251713; CAB63269.1; -. 

DR InterPro; IPR001015; Ferrochelatase . 

DR Pfam; PF00762; Ferrochelatase; 1. 

KW Heme biosynthesis; Iron; Lyase; Porphyrin biosynthesis. 

FT NON_TER 1 1 

SQ SEQUENCE 34 AA; 3921 MW; 948552A9D80C2FE8 CRC64 ; 



Query Match 14.3%; Score 4; DB 2; Length 34; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



16 FFLH 19 

MM 
1 FFLH 4 



RESULT 38 
Q95LA4 

ID Q95LA4 PRELIMINARY; PRT; 34 AA. 

AC Q95LA4; 

DT Ol-DEC-2001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-MAR-2002 (TrEMBLrel. 20, Last annotation update) 

DE LIM homeodomain protein 3b (Fragment) . 

GN LHX3 . 

OS Macaca mulatta (Rhesus macaque) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9544 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21453325; PubMed=11470784 ; 

RA Sloop K.W., Dwyer C.J., Rhodes S.J.; 

RT "An isoform-specif ic inhibitory domain regulates the Lhx3 lim 

RT homeodomain factor holoprotein and the production of a functional 

RT alternate translation form."; 

RL J. Biol. Chem. 276:36311-36319(2001). 

DR EMBL; AF370449; AAL09574.1; -. 

PCW DNA-binding; Homeobox; Nuclear protein. 

FT NON_TER 34 34 

SQ SEQUENCE 34 AA; 3674 MW; D7A2F12 58F8E4 14A CRC64; 

Query Match 14.3%; Score 4; DB 6; Length 34; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 

Db 



11 DLRR 14 

MM 
28 DLRR 31 



RESULT 39 
Q95IiA6 

ID Q95LA6 PRELIMINARY; PRT; 34 AA. 

AC Q95LA6; 

DT Ol-DEC-2001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 {TrEMBLrel . 19, Last sequence update) 

DT Ol-MAR-2002 (TrEMBLrel. 20, Last annotation update) 

DE LIM homeodomain protein 3b (Fragment) , 

GN LHX3 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 

OX NCBI_TaxID=9823 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21453325; PubMed=11470784 ; 

RA Sloop Dwyer C.J., Rhodes S.J.; 

RT "An isoform-specif ic inhibitory domain regulates the Lhx3 lim 

RT homeodomain factor holoprotein and the production of a functional 

RT alternate translation form."; 

RL J. Biol. Chem, 276:36311-36319(2001). 

DR EMBL; AF370445; AAL09570.1; 

KW DNA-binding; Homeobox; Nuclear protein. 

FT NON_TER 34 34 

SQ SEQUENCE 34 AA; 3748 MW; 7083C3258F8E4 14C CRC64 ; 

Query Match 14.3%; Score 4; DB 6; Length 34; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 11 DLRR 14 

MM 

Db 28 DLRR 31 



RESULT 4 0 
Q91ZX2 

ID Q91ZX2 PRELIMINARY; PRT; 34 AA. 

AC Q91ZX2; 

DT Ol-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel . 19 , Last sequence update) 

DT Ol-MAR-2002 (TrEMBLrel. 20, Last annotation update) 

DE LIM homeodomain protein 3b (Fragment) . 

GN LHX3 . 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21453325; PubMed=11470784 ; 

RA Sloop K.W., Dwyer C. J. , Rhodes S.J.; 

RT "An isoform-specif ic inhibitory domain regulates the Lhx3 lim 

RT homeodomain factor holoprotein and the production of a functional 

RT alternate translation form."; 

RL J. Biol- Chem. 276:36311-36319(2001). 



DR EMBL; AF370450; AAL09575.1; 

KW DNA-binding; Homeobox; Nuclear protein. 

FT NON_TER 34 34 

SQ SEQUENCE 34 AA; 3746 MW; D7A2F215BC88212C CRC64; 

Query Match 14.3%; Score 4; DB 11; Length 34; 

Best Local Similarity 100.0%; Pred. No. 3.7e+03; 
Matches 4; Conservative 0; Mismatches 0; Indels 

Qy 11 DLRR 14 

MM 

Db 28 DLRR 31 



Search completed: January 14, 2004, 10:42:13 
Job time : 33.9346 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 
Word size : 



January 14, 2004, 10:28:19 ; Search time 4.62305 Seconds 

(without alignments) 
284.822 Million cell updates/sec 

US-09-843-221A-169 
28 

1 LLHNLGKSIQDLRRRFFLHHLIAEIHTA 28 
OLIGO 

Gapop 60.0 , Gapext 60.0 
127863 seqs, 47026705 residues 
0 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 2 8 
Maximum DB seq length: 4 0 

Post-processing: Listing first 1000 summaries 



1319 



Database 



SwissProt 41 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
ELH_THETS 

ID ELH_THETS STANDARD; PRT; 3 6 AA. 

AC P80594; 

DT Ol-OCT-1996 (Rel . 34, Created) 

DT Ol-OCT-1996 (Rel. 34, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE Egg- laying- like hormone (L-ELH) . 

OS Theromyzon tessulatum (Leech) . 

OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; 

OC Rhynchobdellida; Glossiphoniidae ; Theromyzon. 

OX NCBI_TaxID=13286; 

RN [1] 

RP SEQUENCE . 

RC TISSUE=CNS; 

RX MEDLINE=98047073; PubMed=9387880 ; 

RA Salzet M., Verger-Bocquet M. , Vandenbulcke F., van Minnen J.; 

RT "Leech egg -laying -like hormone: structure, neuronal distribution and 

RT phylogeny."; 

RL Brain Res. Mol . Brain Res. 49:211-221(1997). 

CC -!- FUNCTION: MAY BE INVOLVED IN LEECH REPRODUCTION. 

CC -!- TISSUE SPECIFICITY: SUPRA, SUBESOPHAGEAL GANGLIA AND SEGMENTAL 
CC GANGLIA OF THE VENTRAL NERVE CORD AND BRAIN. 

CC DEVELOPMENTAL STAGE: L-ELH GREATLY INCREASES BEFORE EGG-LAYING, 

CC WHILE IT STRONGLY DECREASES AFTER EGG-LAYING. 

CC -!- MASS SPECTROMETRY: MW-4172.29; METHOD=Electrospray . 

KW Hormone; Amidation; Neuropeptide. 

FT MOD_RES 36 36 AMIDATION. 

SQ SEQUENCE 36 AA; 4290 MW; 2633C4 0D6A15CECC CRC64 ; 

Query Match 17.9%; Score 5; DB 1; Length 36; 

Best Local Similarity 100.0%; Pred. No. 51; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 12 LRRRF 16 

Mill 

Db 29 LRRRF 33 



RESULT 2 
PSAM_EUGGR 

ID PSAM_EUGGR STANDARD; PRT; 31 AA. 

AC P31479; 

DT Ol-JUL-1993 (Rel . 26, Created) 

DT Ol-JUL-1993 (Rel. 26, Last sequence update) 

DT Ol-OCT-1996 (Rel. 34, Last annotation update) 

DE Photosystem I reaction centre subunit XII (PSI-M) , 

GN PSAM . 

OS Euglena gracilis. 

OG Chloroplast . 

OC Eukaryota; Euglenozoa; Euglenida; Euglenales; Euglena. 

OX NCBI_TaxI D=3 03 9 ; 

RN [1] 

RP SEQUENCE PROM N.A. 

RC STRAIN=Z / UTEX 753; 

RX MEDLINE=93347989; PubMed-8346031 ; 

RA Hallick R.B., Hong L. , Drager R.G., Favreau M.R. , Monfort A., 

RA Orsat B,, Spielmann A., Stutz E. ; 

RT "Complete sequence of Euglena gracilis chloroplast DNA."; 

RL Nucleic Acids Res. 21:3537-3544(1993). 

CC -i- SIMILARITY: BELONGS TO THE PSAM FAMILY. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X70810; CAA50083.1; -. 

DR PIR; S34504; S34504 . 

KW Photosystem I; Photosynthesis; Chloroplast. 

SQ SEQUENCE 31 AA; 3443 MW; 85F208627C873165 CRC64 ; 



Query Match 14.3%; Score 4; DB 1; Length 31; 

Best Local Similarity 100.0%; Pred. No. 5.1e+02; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 5 LGKS 8 

MM 

Db 25 LGKS 28 



RESULT 3 
DEF7_RABIT 

ID DEF7_RABIT STANDARD; PRT; 34 AA. 

AC P80223; P80224; 

DT Ol-OCT-1993 (Rel. 27, Created) 

DT Ol-OCT-1993 (Rel. 27, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Corticostatin VI (CS-VI) (Neutrophil antibiotic peptide NP-6) . 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 



ox NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE . 

RC STRAIN=Japanese white; TISSUE^Spleen; 

RX MEDLINE=93387318; PubMed=8397087 ; 

RA Fuse N., Hayashi Y. , Fukata J., Tominaga T. , Ebisui 0., Satoh Y. , 

RA Isohara T. , Uno I . , Imura H. ; 

RT "Purification and characterization of new ant i-adrenocorticotropin 

RT rabbit neutrophil peptides (def ensins) . " ; 

RL Eur, J. Biochem. 216:653-659(1993). 

CC -!- FUNCTION: MICROBICIDAL ACTIVITY AND INHIBITS CORTICOTROPIN (ACTH) 
CC STIMULATED CORTI COSTERONE PRODUCTION. 

CC -1- TISSUE SPECIFICITY: LUNG, SPLEEN, SMALL INTESTINE, PITUITARY 
CC GLAND, ADRENAL MEDULLA AND PLASMA. 

CC -!- SIMILARITY: BELONGS TO THE CORTI COSTATIN/DEFENS IN FAMILY. 

DR PIR; S36843; S36843 . 

DR InterPro; IPR006081; Def ensin_alpha . 

DR InterPro; IPR006080; Def ensin__mammal . 

DR Pfam; PF00323; defensins; 1. 

DR SMART; SM00048; DEFSN; 1. 

DR PROSITE; PS00269; DEFENSIN; 1. 

KW Defensin; Antibiotic. 

FT DISULFID 3 31 

FT DISULFID 5 20 

FT DISULFID 10 30 

SQ SEQUENCE 34 AA; 4052 MW; 



BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
E0090709FA1512B9 CRC64 ; 



Query Match 14.3%; Score 4; DB 1; Length 34; 

Best Local Similarity 100.0%; Pred. No. 5.5e+02; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 
Db 



13 RRRF 16 

MM 

6 RRRF 9 



RESULT 4 
PAHO_MACMU 

ID PAHO_MACMU STANDARD; PRT; 3 6 AA. 

AC P33684; 

DT Ol-FEB-1994 (Rel. 28, Created) 

DT Ol-FEB-1994 (Rel. 28, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Pancreatic hormone (Pancreatic polypeptide) (PP) . 

GN PPY. 

OS Macaca mulatta (Rhesus macaque) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9544 ; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=91164506; PubMed=2 003 150 ; 

RA Yu J . , Xin Y . , Eng J . , Yalow R . S . ; 

RT "Rhesus monkey gastroenteropancreatic hormones: relationship to human 

RT sequences . " ; 

RL Regul, Pept . 32:39-45(1991). 



CC -I- FUNCTION: PANCREATIC HORMONE IS SYNTHESIZED IN PANCREATIC ISLETS 

CC OF LANGERHANS AND ACTS AS A REGULATOR OF PANCREATIC AND 

CC GASTROINTESTINAL FUNCTIONS. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the NPY family. 

DR PIR; C60071; C60071. 

DR HSSP; P013 02; IBBA. 

DR InterPro; IPR001955; Pancreatic__hormn . 

DR Pfam; PF00159; hormone 3 ; 1. 

DR PRINTS; PR00278; PANCHORMONE . 

DR SMART; SM00309; PAH; 1. 

DR PROSITE; PS00265; PANCREATIC_H0RM0NE_1 ; 1. 

DR PROSITE; PS50276; PANCREATIC_H0RM0NE_2 ; 1. 

KW Hormone; Pancreas; Amidation. 

FT M0D_RES 36 3 6 AMIDATION. 

SQ SEQUENCE 36 AA; 4183 MW; 761B717671A7758F CRC64; 

Query Match 14.3%; Score 4; DB 1; Length 36; 

Best Local Similarity 100.0%; Pred. No. 5.8e+02; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 11 DLRR 14 

MM 

Db 23 DLRR 26 



RESULT 5 
PAHO_RABIT 

ID PAHO_RABIT STANDARD; PRT; 36 AA. 

AC P41336; 

DT Ol-FEB-1995 (Rel. 31, Created) 

DT Ol-FEB-1995 (Rel. 31, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Pancreatic hormone (Pancreatic polypeptide) (PP) . 

GN PPY. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE . 

RC TISSUE=Pancreas ; 

RX MEDLINE=94130533; PubMed=82 993 50 ; 

RA Marks N.J., Shaw C. , Halton D.W., Curry W.J., Thim L. ; 

RT "Rabbit pancreatic polypeptide."; 

RL Comp. Biochem. Physiol. 106B : 883-887 ( 1993) . 

CC -!- FUNCTION: PANCREATIC HORMONE IS SYNTHESIZED IN PANCREATIC ISLETS 

CC OF LANGERHANS AND ACTS AS A REGULATOR OF PANCREATIC AND 

CC GASTROINTESTINAL FUNCTIONS. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC SIMILARITY: Belongs to the NPY family. 

DR HSSP; P013 02; IBBA. 

DR InterPro; IPR001955; Pancreatic_hormn . 

DR Pfam; PF00159; hormone3 ; 1. 

DR PRINTS; PRO 02 78; PANCHORMONE. 

DR SMART; SM00309; PAH; 1. 

DR PROSITE; PS00265; PANCREATIC_H0RM0NE_1 ; 1. 



DR PROSITE; PS50276; PANCREATIC_HORiyiONE_2 ; 1. 

KW Hormone; Amidation; Pancreas. 

FT MOD_RES 36 36 AMIDATION. 

SQ SEQUENCE 36 AA; 4197 MW; A14A4E0831A7759D CRC64; 

Query Match 14.3%; Score 4; DB 1; Length 36; 

Best Local Similarity 100.0%; Pred. No, 5.8e+02; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 11 DLRR 14 

MM 

Db 23 DLRR 26 



RESULT 6 
Y4KD_BPCHP 

ID Y4KD_BPCHP STANDARD; PRT; 3 6 AA. 

AC P19188; 

DT Ol-NOV-1990 (Rel . 16, Created) 

DT Ol-NOV-1990 (Rel. 16, Last sequence update) 

DT Ol-NOV-1990 (Rel. 16, Last annotation update) 

DE Hypothetical 4.7 kDa protein (0RF8) . 

OS Bacteriophage Chpl . 

OC Viruses; ssDNA viruses; Microviridae; Microvirus . 

OX NCBI_TaxID=12367; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90111716; PubMed=2 60734 1 ; 

RA Storey C.C., Lusher M., Richmond S.J.; 

RT "Analysis of the complete nucleotide sequence of Chpl, a phage which 

RT infects avian Chlamydia psittaci."; 

RL J. Gen. Virol. 70:3381-3390(1989). 

cc 

CC This SWISS-PROT entry is copyright- It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D00624; BAA00510.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 36 AA; 4670 MW; 98AF35E45B9545A4 CRC64 ; 



Query Match 14.3%; Score 4; DB 1; Length 36; 

Best Local Similarity 100.0%; Pred. No. 5.8e+02; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 12 LRRR 15 

MM 

Db 7 LRRR 10 



RESULT 7 
FIBA_CANFA 

ID FIBA_CANFA STANDARD; PRT; 28 AA. 



AC P02673; P14464; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT Ol-NOV-1990 (Rel. 16, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibrinogen alpha chain [Contains: Fibrinopept ide A] (Fragment). 

GN FGA. 

OS Canis familiaris (Dog) , and 

OS Vulpes vulpes (Red fox) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615, 9627; 

RN [1] 

RP SEQUENCE . 

RC SPECIES=C. familiaris; 

RX MEDLINE=76081726; PubMed=1198547 ; 

RA Birken S., Wilner G.D., Canfield R.E.; 

RT "Studies of the structure of canine fibrinogen."; 

RL Thromb. Res. 7:599-610(1975). 

RN [2] 

RP SEQUENCE OF 1-16. 

RC SPECIES=C. familiaris, and V. vulpes; 

RA Blomback B., Blomback M., Grondahl N.J.; 

RT "Studies on fibrinopept ides from mammals."; 

RL Acta Chem. Scand. 19:1789-1791(1965). 

RN [3] 

RP SEQUENCE OF 1-16. 

RC SPECIES=C. familiaris; 

RX MEDLINE=66020594; PubMed-5836555 ; 

RA Osbahr A.J. Jr., Colman R.W. , Laki K. , Gladner J.A. ; 

RT "The nature of the peptides released from canine fibrinogen."; 

RL Biochem. Biophys. Res. Commun. 14:555-558(1964). 

CC -!- FUNCTION: FIBRINOGEN HAS A DOUBLE FUNCTION: YIELDING MONOMERS THAT 
CC POLYMERIZE INTO FIBRIN AND ACTING AS A COFACTOR IN PLATELET 

CC AGGREGATION. 

CC -!- SUBUNIT: HEXAMER CONTAINING 2 SETS OF 3 NONIDENTICAL CHAINS 

CC (ALPHA, BETA AND GAMMA) , LINKED TO EACH OTHER BY DISULFIDE BONDS. 

CC -!- MISCELLANEOUS: CONVERSION OF FIBRINOGEN TO FIBRIN IS TRIGGERED BY 

CC THROMBIN, WHICH CLEAVES FIBRINOPEPTIDES A AND B FROM ALPHA & BETA 

CC CHAINS, AND THUS EXPOSES THE N-TERMINAL POLYMERIZATION SITES 

CC RESPONSIBLE FOR THE FORMATION OF THE SOFT CLOT. 

DR PIR; A94308; A05296 . 



KW 


Blood coagulation; 


Plasma ; 


Phosphorylation . 


FT 


PEPTIDE 


1 


16 


FIBRINOPEPTIDE A, 


FT 


MOD_RES 


3 


3 


PHOSPHORYLATION (PARTIAL 


FT 


CONFLICT 


2 


2 


N -> D (IN REF. 2) . 


FT 


CONFLICT 


4 


7 


KEGE -> EGKQ (IN REF. 2) 


FT 


NON_TER 


28 


28 




SQ 


SEQUENCE 


28 AA; 


2958 MW; 


09DCD3F923BFEBD2 CRC64; 



Query Match 10.7%; Score 3; DB 1; Length 28; 

Best Local Similarity 100.0%; Pred. No. 5.2e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 22 lAE 24 

Ml 

Db 9 lAE 11 



RESULT 8 
GTS 5 CHICK 



ID GTS5_CHICK STANDARD; PRT; 28 AA, 

AC P20137; 

DT Ol-FEB-1991 (Rel . 17, Created) 

DT Ol-FEB-1991 (Rel. 17, Last sequence update) 

DT Ol-FEB-1996 (Rel. 33, Last annotation update) 

DE Glutathione S-transf erase 5 (EC 2.5.1.18) (GST-CL5) (GST class-sigma) 

DE (Fragment) . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE. 

RC TISSUE=Liver; 

RX MEDLINE=90248419; PubMed=23375 94 ; 

RA Chang L.-H., Chuang L.-F., Tsai C.-P., Tu C.-P.D., Tarn M.F.; 

RT "Characterization of glutathione S-transf erases from day-old chick 

RT livers."; 

RL Biochemistry 29:744-750(1990). 

CC -!- FUNCTION: CONJUGATION OF REDUCED GLUTATHIONE TO A WIDE NUMBER 

CC OF EXOGENOUS AND ENDOGENOUS HYDROPHOBIC ELECTROPHILES . 

CC -I- CATALYTIC ACTIVITY: RX + glutathione = HX + R-S-glutathione , 

CC -!- SUBUNIT: Homodimer , 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: BELONGS TO THE GST SUPERFAMILY. SIGMA FAMILY. 

DR FIR; C33948; C33948 . 

DR InterPro; IPR004045; GST_Nterm. 

DR Pfam; PF02798; GST_N; 1. 

KW Transferase. 

FT NON_TER 28 28 

SQ SEQUENCE 28 AA; 338 9 MW; DA51DC0ABD203B12 CRC64; 



Query Match 10.7%; Score 3; DB 1; Length 28; 

Best Local Similarity 100.0%; Pred. No. 5.2e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 23 AEI 25 

Ml 

Db 14 AEI 16 



RESULT 9 
HSP4_0CTVU 

ID HSP4_0CTVU STANDARD; PRT; 28 AA. 

AC P83216; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Sperm protamine P4 (Po4) . 

OS Octopus vulgaris (Octopus) . 

OC Eukaryota; Metazoa; Mollusca; Cephalopoda; Coleoidea; Neocoleoidea; 
OC Octopodi formes; Octopoda; Incirrata; Octopodidae; Octopus. 
OX NCBI TaxID=6645; 



RN 
RP 
RC 
RA 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
SQ 



[1] 

SEQUENCE, FUNCTION, AND MASS SPECTROMETRY. 
TISSUE=Sperm; 

Gimenez-Bonafe P., Ribes E., Buesa Sautiere P., Kouach M. , 

Ausio J., Kasinsky H.E., Chiva M.; 

"Chromatin remodelling and protamines during spermiogenesis of Octopus 
vulgaris (Cephalopoda) . " ; 
J. Exp. Zool. 0:0-0(2001). 

-!- FUNCTION: Protamines substitute for histones in the chromatin of 
sperm during the haploid phase of spermatogenesis. They compact 
sperm DNA into a highly condensed, stable and inactive 
complex . 

SUBCELLULAR LOCATION: Nuclear. 
TISSUE SPECIFICITY: Testis. 

MASS SPECTROMETRY: MW=353 7; METHOD=Electrospray . 
GO; GO: 0005718; C : nucleosome ; NAS . 
GO; GO: 0005634; C:nucleuS; NAS. 
GO; GO: 0003677; F : DNA binding activity; NAS. 

GO; GO:0007001; P : chromosome organization and biogenesis (sen. . .; NAS. 

GO; GO: 0007076; P: mitotic chromosome condensation; NAS. 

GO; GO: 0006334; Pmucleosome assembly; NAS. 

GO; GO: 0007283; P : spermatogenesis ; NAS. 

Chromosomal protein; Nucleosome core; Spermatogenesis; 

DNA-binding; DNA condensation; Nuclear protein. 

DOMAIN 1 7 POLY-ARG. 

DOMAIN 18 26 POLY-ARG. 

SEQUENCE 28 AA; 3538 MW; A4 0B4D2C1B8E2 OED CRC64 ; 



Query Match 10.7%; 
Best Local Similarity 100.0% 
Matches 3; Conservative 



Score 3; DB 1; Length 28; 
Pred. No- 5.2e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



13 RRR 15 

Ml 

1 RRR 3 



RESULT 10 
PA23_TRIST 

ID PA23_TRIST STANDARD; PRT; 28 AA. 

AC P82894; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Phospholipase A2, basic 3 (EC 3.1.1.4) (PA2-III) (PLA2-III) 

DE (Phosphatidylcholine 2-acylhydrolase) (Fragment) . 

OS Trimeresurus stejnegeri (Chinese green tree viper) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Lepidosauria; Squamata; Scleroglossa ; Serpentes; Colubroidea; 

OC Viperidae; Crotalinae; Trimeresurus. 

OX NCBI_TaxID=39682; 

RN [1] 

RP SEQUENCE . 

RC TISSUE^ Venom; 

RA Li S.Y-, Wang W.Y., Xiong Y.L. ; 

RT "Isolation, sequence and characterization of five variants of 

RT phospholipase A2 from venom of snake Trimeresurus stejnegeri."; 



RL Sijbmitted (DEC-2000) to the SWISS-PROT data bank. 

CC FUNCTION: PA2 catalyzes the calcium-dependent hydrolysis of the 2- 

CC acyl groups in 3-sn-phosphoglycerides . Hemolytic and neurotoxic 

CC activities are not detected. 

CC -1- CATALYTIC ACTIVITY: Phosphatidylcholine + H(2)0 = 1- 

CC acylglycerophosphocholine + a fatty acid anion. 

CC -I- COFACTOR: Binds 1 calcium ion per subunit {By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: BELONGS TO THE PHOSPHOLI PASE A2 FAMILY. GROUP II 

CC SUBFAMILY. 

DR HSSP; P82287; IQLL. 

DR InterPro; IPR001211; PhospholipaseA2 . 

DR Pfam; PF00068; phoslip; 1, 

DR ProDom; PD000303; PhospholipaseA2 ; 1. 

DR PROSITE; PS00119; PA2_ASP; PARTIAL. 

DR PROSITE; PS00118; PA2_HIS; PARTIAL. 

KW Hydrolase; Lipid degradation; Calcium; Multigene family. 

FT NON_TER 28 28 

SQ SEQUENCE 28 AA; 3023 MW; 042104521CA1F103 CRC64 ; 



Query Match 10.7%; 
Best Local Similarity 100.0% 
Matches 3; Conservative 



Score 3; DB 1; Length 28; 
Pred. No. 5-2e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



5 LGK 7 

Ml 
5 LGK 7 



RESULT 11 
VG9_SPV4 

ID VG9_SPV4 STANDARD; PRT; 28 AA. 

AC P11341; 

DT Ol-JUL-1989 (Rel. 11, Created) 

DT Ol-JUL-1989 (Rel. 11, Last sequence update) 

DT Ol-JUL-1989 (Rel. 11, Last annotation update) 

DE Gene 9 protein. 

GN 9. 

OS Spiroplasma virus 4 (SpV4) . 

OC Viruses; ssDNA viruses; Microviridae ; Spiromicrovirus . 

OX NCBI_TaxID=10855 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=88032809; PubMed=2822658 ; 

RA Renaudin J., Pascarel M.-C, Bove J.-M.; 

RT "Spiroplasma virus 4: nucleotide sequence of the viral DNA, 

RT regulatory signals, and proposed genome organization."; 

RL J. Bacterid. 169:4950-4961(1987). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 



DR EMBL; M17988; -; NOT_ANNOTATED_CDS . 
DR PIR; B29825; G9BPSV. 

SQ SEQUENCE 28 AA; 3776 MW; 9916C3C9C3B9FD1D CRC64 ; 

Query Match 10,7%; Score 3; DB 1; Length 28; 

Best Local Similarity 100.0%; Fred. No. 5.2e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 13 RRR 15 

IN 

Db 17 RRR 19 



RESULT 12 
VI03_VACCP 

ID VI03_VACCP STANDARD; PRT; 28 AA. 

AC Q00334; 

DT Ol-APR-1993 (Rel. 25, Created) 

DT Ol-APR-1993 (Rel. 25, Last sequence update) 

DT Ol-FEB-1994 (Rel. 28, Last annotation update) 

DE Protein 13 (Fragment) . 

GN I3L, 

OS Vaccinia virus (strain L-IVP) . 

OC Viruses; dsDNA viruses, no RNA stage; Poxviridae; Chordopoxvirinae; 

OC Orthopoxvirus . 

OX NCBI_TaxID=31531; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91066899; PubMed=22 50685 ; 

RA Ryazankina O.I., Shchelkunov S.N., Muravlev A.I., Netesova N.A. , 

RA Mikryukov N.N., Gutorov V.V., Nikulin A.E., Kulichkov V.A. , 

RA Malygin E.G. ; 

RT "Molecular-biological study of vaccinia virus genome. II. 

RT Localization and nucleotide sequence of vaccinia virus genes coding 

RT for proteins 36K and 12K."; 

RL Mol. Biol. (Mosk) 24:968-976(1990). 

CC -!- MISCELLANEOUS: THIS PROTEIN IS SYNTHESIZED IN THE EARLY AS WELL AS 
CC THE LATE PHASE OF INFECTION. 

CC 7" 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X61165; CAA43473.1; -. 

DR InterPro; IPR006754; Pox_I3. 

DR Pfam; PF04661; Pox_I3; 1. 

KW Early protein; Late protein. 

FT NON_TER 1 1 

SQ SEQUENCE 28 AA; 3238 MW; CE108 13AC544F010 CRC64; 



Query Match 10.7%; Score 3; DB 1; Length 28; 

Best Local Similarity 100.0%; Pred. No. 5.2e+03; 
Matches 3; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 0 ; 



Qy 
Db 



4 NLG 6 

III 

5 NLG 7 



RESULT 13 
Y073_ARCFU 

ID Y073_ARCFU STANDARD; PRT; 28 AA. 

AC 030163; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Hypothetical protein AF0073. 

GN AF0073. 

OS Archaeoglobus f ulgidus . 

OC Archaea; Euryarchaeota ; Archaeoglobi ; Archaeoglobales; 

OC Archaeoglobaceae; Archaeoglobus. 

OX NCBI_TaxID=2234 ; 

RN [1] 

RP SEQUENCE FROM N.A, 

RC STRAIN-VC-16 / DSM 4304 / ATCC 49558; 

RX MEDLINE-98049343; PubMed=9389475 ; 

RA Klenk H.-P., Clayton R.A. , Tomb J.-F., White 0., Nelson K.E., 

RA Ketchutn K.A. , Dodson R. J. , Gwinn M. , Hickey E.K., Peterson J.D. , 

RA Richardson D.L., Kerlavage A.R, , Graham D.E., Kyrpides N.C., 

RA Fleischmann R.D., Quackenbush J., Lee N.H., Sutton G.G. , Gill S., 

RA Kirkness E.F., Dougherty B.A., McKenney K. , Adams M.D., Loftus B, , 

RA Peterson S,, Reich C.I., McNeil L.K., Badger J.H. , Glodek A., Zhou L, , 

RA Overbeek R., Gocayne J.D. , Weidman J.F., McDonald L. , Utterback T. , 

RA Cotton M.D., Spriggs T. , Artiach P., Kaine B.P., Sykes S.M., 

RA Sadow P.W., D'Andrea K.P., Bowman C. , Fujii C. , Garland S.A., 

RA Mason T.M., Olsen G.J., Eraser CM., Smith H.O., Woese C.R., 

RA Venter J. C. ; 

RT "The complete genome sequence of the hyperthermophilic, sulphate- 

RT reducing archaeon Archaeoglobus fulgidus . " ; 

RL Nature 390:364-370(1997). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AEOOllOl; AAB91159.1; -. 

DR PIR; A69259; A69259. 

DR TIGR; AF0 073; 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 28 AA; 3222 MW; 4EA8550DA1345BC9 CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 28; 

Best Local Similarity 100.0%; Pred. No. 5.2e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 23 AEI 25 



Db 



III 

17 AEI 



19 



RESULT 14 
YA79_ARCFU 

ID YA7 9_ARCFU STANDARD; PRT; 28 AA. 

AC 02 9184; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Hypothetical protein AF1079. 

GN API 07 9 - 

OS Archaeoglobus fulgidus. 

OC Archaea; Euryarchaeota ; Archaeoglobi ; Archaeoglobales ; 

OC Archaeoglobaceae; Archaeoglobus. 

OX NCBI_TaxID=2234 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=VC-16 / DSM 4304 / ATCC 49558; 

RX MEDLINE=98049343; PubMed=938 9475 ; 

RA Klenk H.-P., Clayton R.A., Tomb J.-F., White 0., Nelson K.E., 

RA Ketchum K.A. , Dodson R.J., Gwinn M., Hickey E.K., Peterson J.D., 

RA Richardson D.L. , Kerlavage A.R., Graham D.E., Kyrpides N.C., 

RA Fleischmann R.D., Quackenbush J., Lee N.H., Sutton G.G., Gill S., 

RA Kirkness E.F., Dougherty B.A., McKenney K. , Adams M.D., Loftus B. , 

RA Peterson S., Reich C.I., McNeil L.K., Badger J.H, , Glodek A. , Zhou L., 

RA Overbeek R., Gocayne J.D., Weidman J.F., McDonald L. , Utterback T., 

RA Cotton M.D., Spriggs T. , Artiach P., Kaine B.P., Sykes S.M., 

RA Sadow P.W., D'Andrea K.P., Bowman C. , Fujii C. , Garland S.A., 

RA Mason T.M., Olsen G.J., Eraser CM., Smith H.O., Woese C.R., 

RA Venter J.C. ; 

RT "The complete genome sequence of the hyperthermophil ic, sulphate- 
RT reducing archaeon Archaeoglobus fulgidus."; 

RL Nature 390:364-370(1997). 

CC SIMILARITY: BELONGS TO THE UPF0165 FAMILY. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE001028; AAB90160.1; -. 

DR PIR; G69384; G69384. 

DR TIGR; AF1079; -. 

DR InterPro; IPR002807; DUF104. 

DR Pfam; PF01954; DUF104; 1. 

DR ProDom; PD005964; DUF104_N; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 28 AA; 3289 MW; 22502C0A72E6A1D5 CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 28; 

Best Local Similarity 100.0%; Pred. No. 5.2e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 11 DLR 13 

III 

Db 21 DLR 23 



RESULT 15 
12 AH CL0S4 



ID 12AH_CLOS4 STANDARD; PRT; 2 9 AA. 

AC P21215; 

DT Ol-AUG-1991 (Rel . 19, Created) 

DT Ol-AUG-1991 (Rel . 19, Last sequence update) 

DT 15-SEP-2003 (Rel . 42, Last annotation update) 

DE 12-alpha-hydroxysteroid dehydrogenase (EC 1.1.1.176) (Fragment), 

OS Clostridium sp. (strain C 48-50) . 

OC Bacteria; Firmicutes ; Clostridia; Clostridiales ; Clostridiaceae; 

OC Clostridium. 

OX NCB I_TaxI D= 1 507; 

RN [1] 

RP SEQUENCE . 

RX MEDLINE=91177018; PubMed=2 0074 06 ; 

RA Braun M., Luensdorf H. , Bueckmann A.F.; 

RT "12 alpha -hydroxys teroid dehydrogenase from Clostridium group P, 

RT strain C 48-50. Production, purification and characterization,"; 

RL Eur. J. Biochem. 196:439-450(1991). 

CC -!- FUNCTION: Catalyzes the oxidation of the 12 -alpha -hydroxy 1 group 
CC of bile acids, both in their free and conjugated form. Also acts 

CC on bile alcohols. 

CC -!- CATALYTIC ACTIVITY: 3-alpha, 7-alpha, 12-alpha-trihydroxy-5-beta- 
CC cholanate + NADP(+) = 3 -alpha , 7-alpha-dihydroxy-12-oxo-5-beta- 

CC cholanate + NADPH. 

CC -!- SUBUNIT: Homotetramer . 

CC MISCELLANEOUS: The thermostability of the enzyme is greatly 

CC increased due to NADP binding. 

DR PIR; S14099; S14099. 

KW Bile acid catabolism; Oxidoreductase; NADP, 

FT NON_TER 2 9 29 

SQ SEQUENCE 29 AA; 2900 MW; A827DB34DB6C8 8 12 CRC64; 



Query Match 10.7%; Score 3; DB 1; Length 29; 

Best Local Similarity 100,0%; Pred. No. 5.3e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 7 KSI 9 

III 

Db 17 KSI 19 



RESULT 16 
DMS5_PHYSA 

ID DMS5_PHYSA STANDARD; PRT; 2 9 AA, 

AC P80281; 

DT Ol-FEB-1994 (Rel, 28, Created) 

DT Ol-FEB-1994 (Rel. 28, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Dermaseptin 5 (DS V) . 

OS Phyllomedusa sauvagei (Sauvage's leaf frog). 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Neobatrachia ; Bufonoidea; Hylidae; 

OC Phyllomedusinae; Phyllomedusa . 

OX NCBI__TaxID=8395; 

RN [1] 

RP SEQUENCE. 

RC TISSUE=Skin secretion; 

RX MEDLINE=94139686; PubMed=83 0698 1 ; 

RA Mor A. , Nicolas P.; 

RT "Isolation and structure of novel defensive peptides from frog skin." 

RL Eur. J- Biochem. 219:145-154(1994). 

CC -!- FUNCTION: POSSESSES A POTENT ANTIMICROBIAL ACTIVITY AGAINST 

CC BACTERIA FUNGI AND PROTOZOA. PROBABLY ACTS BY DISTURBING MEMBRANE 

CC FUNCTIONS WITH ITS AMPHIPATIC STRUCTURE. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Skin. 

CC -!- SIMILARITY: Belongs to the frog skin active peptide (FSAP) family 
CC Dermaseptin subfamily. 

KW Amphibian defense peptide; Antibiotic; Fungicide; Multigene family. 

SQ SEQUENCE 29 AA; 2840 MW; 540A4971FC5BB506 CRC64; 

Query Match 10.7%; Score 3; DB 1; Length 29; 

Best Local Similarity 100.0%; Pred. No. 5.3e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 6 GKS 8 

III 

Db 10 GKS 12 



RESULT 17 
P2SM_L0XIN 

ID P2SM_L0XIN STANDARD; PRT; 2 9 AA. 

AC P83046; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Sphingomyelinase P2 (EC 3.1.4.12) (Fragment). 

OS Loxosceles intermedia (Spider) . 

OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Araneae; 

OC Araneomorphae; Haplogynae; Sicariidae; Loxosceles. 

OX NCBI_TaxID=58218 ; 

RN [1] 

RP SEQUENCE, FUNCTION, CATALYTIC ACTIVITY, COFACTOR, SUBCELLULAR 

RP LOCATION, AND TISSUE SPECIFICITY. 

RC TISSUE=Venom; 

RX MEDLINE-99009277; PubMed=9790962 ; 

RA Tambourgi D.V., Magnoli F.C., van den Berg C.W. , Morgan B.P., 

RA de Araujo P.S., Alves E.W., Da Silva W.D.; 

RT "Sphingomyelinases in the venom of the spider Loxosceles intermedia 

RT are responsible for both dermonecrosis and complement -dependent 

RT hemolysis . " ; 

RL Biochem. Biophys . Res. Commun. 251:366-373(1998). 

CC -!- FUNCTION: Has Sphingomyelinase activity. Induces complement- 

CC dependent hemolysis and dermonecrosis. 

CC -!- CATALYTIC ACTIVITY: Sphingomyelin + H(2)0 = N-acylsphingosine + 
CC choline phosphate. 



CC COFACTOR: Calcium ion. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -I- TISSUE SPECIFICITY: Expressed by the venom gland. 

KW Hydrolase; Toxin; Calcium; Hemolysis. 

FT NON_TER 2 9 29 

SQ SEQUENCE 29 AA; 3281 MW; 4488EDD619BD23 98 CRC64 ; 



Query Match 10.7%; 
Best Local Similarity 100.0% 
Matches 3; Conservative 



Score 3; DB 1; Length 29; 
■ Pred. No. 5.3e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



Qy 

Db 



4 NLG 6 
111 

25 NLG 27 



RESULT 18 
PR01__DACGL 

ID PR01_DACGL STANDARD; PRT; 2 9 AA. 

AC P18689; 

DT Ol-NOV-1990 (Rel . 16, Created) 

DT Ol-NOV-1990 (Rel. 16, Last sequence update) 

DT Ol-NOV-1997 (Rel. 35, Last annotation update) 

DE Prolamin alpha -1 (Fragment) . 

OS Dactyl is glomerata (Orchard grass) (Cocksfoot grass) . 

OC Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; Pooideae; 

OC Poeae; Dactylis. 

OX NCBI_TaxID=4 509; 

RN [1] 

RP SEQUENCE. 

RA Vvedenskaya I.O., Shlyapnikov S.V., Konarev A.V. ; 

RT "Characterization of the N- terminal amino acid sequence of alpha- 

RT prolamine from Dactylis glomerata L."; 

RL Biokhimiia 51:1519-1522(1986). 

DR PIR; S02200; S02200- 

FT NON_TER 29 2 9 

SQ SEQUENCE 29 AA; 3647 MW; DEAEB9ADC967D256 CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 29; 

Best Local Similarity 100,0%; Pred. No. 5.3e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 16 FFL 18 

III 

Db 21 FFL 23 



RESULT 19 
CH60_CLOPA 

ID CH60_CLOPA STANDARD; PRT; 3 0 AA. 

AC P81339; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE 60 kDa chaperonin (Protein Cpn60) (groEL protein) (CP 3) (Fragment) . 
GN GROL OR GROEL OR MOPA. 



OS Clostridium pasteurianum. 

OC Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; 

OC Clostridium. 

OX NCBI_TaxID=1501; 

RN [1] 

RP SEQUENCE . 

RC STRAIN-W5; 

RX MEDLINE=98291870; PubMed=9629918 ; 

RA Flengsrud R., Skjeldal L. ; 

RT "Two-dimensional gel electrophoresis separation and N-terminal 

RT sequence analysis of proteins from Clostridium pasteurianum W5 . " ; 

RL Electrophoresis 19:802-806(1998). 

CC -!- FUNCTION: Prevents misfolding and promotes the refolding and 
CC proper assembly of unfolded polypeptides generated under stress 

CC conditions (By similarity) . 

CC -!- SUBUNIT: Oligomer of 14 subunits composed of two stacked rings of 

CC 7 subunits (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic, 

CC -!- SIMILARITY: Belongs to the chaperonin (HSP60) family. 

DR HAMAP; MF_00600; -; 1. 

DR InterPro; IPR001844; Chaprnin__Cpn60 . 

DR PROSITE; PS00296; CHAPERONINS_CPN60 ; PARTIAL. 

KW Chaperone; ATP-binding. 

FT NON_TER 3 0 3 0 

SQ SEQUENCE 30 AA; 3124 MW; 193F5EF05825BF1D CRC64 ; 



Query Match 10.7%, 
Best Local Similarity 100.0^ 
Matches 3; Conservative 



Score 3; DB 1; Length 30; 
Pred. No. 5.5e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



7 KSI 9 

III 
2 KSI 4 



RESULT 2 0 
HSP5_0CTVU 

ID HSP5_0CTVU STANDARD; PRT; 3 0 AA. 

AC P83217; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Sperm protamine P5 (Po5) . 

OS Octopus vulgaris (Octopus) . 

OC Eukaryota; Metazoa; Mollusca; Cephalopoda; Coleoidea; Neocoleoidea; 

OC Octopodi formes; Octopoda; Incirrata; Octopodidae; Octopus. 

OX NCBI__TaxID=6645 ; 

RN [1] 

RP SEQUENCE, FUNCTION, AND MASS SPECTROMETRY. 

RC TISSUE=Sperm; 

RA Gimenez-Bonafe P., Ribes E., Buesa C. , Sautiere P., Kouach M. , 

RA Ausio J., Kasinsky H.E., Chiva M. ; 

RT "Chromatin remodelling and protamines during spermiogenesis of Octopus 

RT vulgaris (Cephalopoda)."; 

RL J. Exp. Zool. 0:0-0(2001). 

CC -!- FUNCTION: Protamines substitute for histones in the chromatin of 
CC sperm during the haploid phase of spermatogenesis. They compact 



983 
984 
985 
986 
987 
988 
989 
990 
991 
992 
993 
994 
995 
996 
997 
998 
999 
1000 



2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 



7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 



1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 



30 
30 
30 
30 
30 
30 
30 
30 
30 
30 
30 
30 
30 
30 
30 
30 
30 
30 



2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 



B45095 
A22977 
S08565 
B61278 
S30757 
S30760 
S30759 
S13753 
S26175 
S68312 
S42364 
S05223 
S30333 
S28991 
PC2307 
PQ0484 
C43591 
B56586 



photosystem I ligh 
delta -endotoxin - 
ribulose-bisphosph 
coat protein VPl - 
genome polyprotein 
genome polyprotein 
genome polyprotein 
replication initia 
tail tubular prote 
glucuronosyl trans f 
aromatic -amino -aci 
photosystem I 6 . 5K 
N-carbamoyl-D-amin 
antifungal protein 
X-Pro aminopeptida 
globulin lb - taro 
5 IK outer membrane 
storage hexamer 2 



ALIGNMENTS 



RESULT 1 
A05012 

hypothetical protein 30 - liverwort (Marchantia polymorpha) chloroplast 
C; Species: chloroplast Marchantia polymorpha 

C;Date: 05-Jun-1987 #sequence_revision 05-Jun-1987 #text_change 21-Jul-2000 
C;Accession: S01582; A05012 

R;Umesono, K. ; Inokuchi, H. ; Shiki, Y. ; Takeuchi , M.; Chang, Z.; Fukuzawa, H. ; 
Kohchi, T.; Shirai, H.; Ohyama, K. ; Ozeki, H. 
J. Mol. Biol. 203, 299-331, 1988 

A; Title: Structure and organization of Marchantia polymorpha chloroplast genome. 
II. Gene organization of the large single copy region from rps ' 12 to atpB. 
A;Reference number: S01567; MUID : 89068686 ; PMID:2974085 
A;Accession: S01582 
A; Molecule type: DNA 
A; Residues: 1-30 <UME> 

A; Cross-references: EMBL:X04465; NID:gll640/ PIDN : CAA28 070 . 1 ; PID:gll657 
A;Note: the authors translated the codon GAA for residue 2 as Val and ATT for 
residues 13 and 14 as Asn 

R;Ohyama, K. ; Fukuzawa, H.; Kohchi, T.; Shirai, H, ; Sano, T. ; Sano, S.; Umesono, 
K. ; Shiki, Y.; Takeuchi , M. ; Chang, Z.; Acta, S.; Inokuchi, H.; Ozeki, H. 
Nature 322, 572-574, 1986 

A; Title: Chloroplast gene organization deduced from complete sequence of 
liverwort Marchantia polymorpha chloroplast DNA. 
A; Reference number: A3 8 014 

A; Contents: annotation; gene organization, sites, features 

C; Genetics : 

A; Genome: chloroplast 

C; Keywords : chloroplast 

Query Match 14.3%; Score 4; DB 2; Length 30; 

Best Local Similarity 100.0%; Fred. No. l.le+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 15 RFFL 18 



INI 

Db 24 RFFL 27 



RESULT 2 
S34504 

photosystem I protein psaM - Euglena gracilis chloroplast 
C; Species: chloroplast Euglena gracilis 

C;Date: lO-Sep-1999 #sequence_revision lO-Sep-1999 #text_change 29-Oct-1999 
C;Accession: S34504; S34871 

R;Hallick, R.B.; Hong, L. ; Drager, R.G.; Favreau, M.; Monfort, A.; Orsat, B.; 
Spielmann, A.; Stutz, E. 

submitted to the EMBL Data Library, January 1993 

A;Description: The complete sequence of the Euglena gracilis chloroplast genome 
(tentative) . 

A; Reference number: S34494 
A;Accession: S34504 
A; Molecule type: DNA 
A;ResidueS: 1-31 <HAL1> 

A; Cross-references: EMBL:X70810; NID:g415327; PIDN : CAA50083 . 1 ; PID:g415739 
R;Hallick, R.B.; Hong, L. ; Drager, R.G. ; Favreau, M.R. ; Monfort, A.; Orsat, B.; 
Spielmann, A.; Stutz, E. 
Nucleic Acids Res. 21, 3537-3544, 1993 

A;Title: Complete sequence of Euglena gracilis chloroplast DNA, 
A;Reference number: S34862; MUID : 93347989 ; PMID: 8346031 
A; Accession: S34871 

A; Status: nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A /Residues: 1-31 <HAL2> 

A; Cross -references : EMBL : X70810 ; NID:g415327; PIDN : CAA50083 . 1 ; PID:g415739 
A;Note: the nucleotide sequence was submitted to the EMBL Data Library, January 
1993 

C; Genetics : 

A; Gene: psaM 

A; Genome: chloroplast 

C;Superfamily: Euglena gracilis chloroplast photosystem I protein psaM 
C; Keywords: chloroplast; photosynthesis; photosystem I; thylakoid 

Query Match 14,3%; Score 4; DB 1; Length 31; 

Best Local Similarity 100,0%; Pred. No. l.le+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 5 LGKS 8 

MM 

Db 25 LGKS 28 



RESULT 3 
PQ0416 

RNA-directed RNA polymerase (EC 2.7.7.48) 3 - influenza A virus (strain 
A/Yamagata/120/86 [HlNl] ) (fragment) 
N;Alternate names: P2 protein; PA protein 
C; Species: influenza A virus 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 07-May-1999 
C; Access ion: PQ0416 

R;Li, X,S.; Zhao, C.Y.; Gao , H.M.; Zhang, Y.Q. ; Ishida, M.; Kanegae, Y. ; Endo, 
A.; Nerome, R.; Omoe, K. ; Nerome, K. 



J. Gen. Virol. 73, 1329-1337, 1992 

A;Title: Origin and evolutionary characteristics of antigenic reassortant 

influenza A (H1N2) viruses isolated from man in China. 

A;Reference number: PQ0408; MUID: 92300326 ; PMID : 1607856 

A; Access ion: PQ0416 

A;Molecule type: genomic RNA 

A; Residues: 1-33 <LIA> 

C; Genetics : 

A; Map position: segment 3 

C; Super family: influenza virus RNA-directed RNA polymerase 3 
C; Keywords : nucleotidyltransferase 

Query Match 14.3%; Score 4; DB 2; Length 33; 

Best Local Similarity 100.0%; Pred. No. 1.2e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 6 GKSI 9 

MM 

Db 16 GKSI 19 



RESULT 4 
S36843 

defensin NP-6 - rabbit 
N;Contains: defensin des-Gl-NP-6 

C;Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: lO-Dec-1993 #sequence_revision 13-Mar-1997 #text_change 13-Mar-1998 
C;Accession: S36843; S36844 

R;Fuse, N.; Hayashi, Y.; Fukata, J.; Tominaga, T. ; Ebisui, 0.; Satoh, Y.; 
Isohara, T.; Uno, I.; Imura, H. 
Eur. J. Biochem. 216, 653-659, 1993 

A;Title: Purification and characterization of new anti-adrenocorticotropin 
rabbit neutrophil peptides (defensins) . 

A;Reference number: S36843; MUID: 93387318 ; PMID:8397087 

A; Access ion: S3 6843 

A;Molecule type: protein 

A; Residues: 1-34 <FUS> 

A;Accession: S36844 

A; Molecule type: protein 

A;Residues: 2-34 <FU2> 

A;Note: defensin des-Gl-NP-6 

C; Superfamily : mammalian defensin 

Query Match 14.3%; Score 4; DB 2; Length 34; 

Best Local Similarity 100.0%; Pred. No. 1.2e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 13 RRRF 16 

MM 

Db 6 RRRF 9 



RESULT 5 
A39888 

synapsin I - rat (fragments) 

C;Species: Rattus norvegicus (Norway rat) 

C;Date: 31-Jan-1992 #sequence_revision 31-Jan-1992 #text_change 18-Jun-1993 



C;Accession: A39888 

R;Czernik, A.J,; Pang, D.T. ; Greengard, P. 

Proc. Natl. Acad. Sci. U.S.A. 84, 7518-7522, 1987 

A; Title: Amino acid sequences surrounding the cAMP- dependent and 

calcium/calmodul in-dependent phosphorylation sites in rat and bovine synapsin I. 

A;Reference number: A39888; MUID: 88041137 ; PMID:3118371 

A;Accession: A39888 

A; Status: preliminary 

A;Molecule type: protein 

A;Residues: 1-34 <CZE> 

Query Match 14.3%; Score 4; DB 2; Length 34; 

Best Local Similarity 100.0%; Pred. No. 1.2e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 12 LRRR 15 

MM 

Db 3 LRRR 6 



RESULT 6 
S20042 

hypothetical protein 1 - Haemophilus influenzae insertion sequence IS1016(V-5) 
(fragment) 

C; Species: Haemophilus influenzae 

C;Date: 07-May-1998 #sequence_revision 17-Jul-1998 #text_change 17-Jul-1998 
C;Accession: S20042 

R;Kroll, J.S.; Loynds , B.M.; Moxon, E.R. 
Mol. Microbiol. 5, 1549-1560, 1991 

A; Title: The Haemophilus influenzae capsulation gene cluster: a compound 
transposon. 

A;Reference number: S16288; MUID: 92157882 ; PMID: 1664907 
A; Access ion: S20042 

A; Status: preliminary; translation not shown 

A; Molecule type: DNA 

A;Residues: 1-35 <KRO> 

A ; Cro s s - references : EMBL : X5 8 1 7 7 

C; Genetics : 

A;Mobile element: insertion sequence IS1016(V-5) 

Query Match 14.3%; Score 4; DB 2; Length 35; 

Best Local Similarity 100.0%; Pred. No. 1.2e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



7 KSIQ 10 

MM 
10 KSIQ 13 



RESULT 7 
E64108 

protein V6, truncated - Haemophilus influenzae insertion sequence IS1016 
C; Species: Haemophilus influenzae 

C;Date: 18-Aug-1995 #sequence_revision 18-Aug-1995 #text_change 24-Oct-1998 
C; Access ion: E64108 

R;Fleischmann, R.D. ; Adams, M.D.; White, 0.; Clayton, R.A.; Kirkness, E.F.; 
Kerlavage, A.R,; Bult, C.J.; Tomb, J.F.; Dougherty, B.A.; Merrick, J.M. ; 



McKenney, K. ; Sutton, G.; FitzHugh, W. ; Fields, C. ; Gocayne, J.D,; Scott, J. ; 
Shirley, R. ; Liu, L.I,; Glodek, A.; Kelley, J.M.; Weidman, J. F . ; Phillips, C.A. ; 
Spriggs, T. ; Hedblom, E.; Cotton, M.D.; Utterback, T.R. ; Hanna, M.C.; Nguyen, 
D.T.; Saudek, D.M.; Brandon, R.C.; Fine, L.D.; Fritchman, J.L.; Fuhrmann, J,L.; 
Geoghagen , N . S . M . 
Science 269, 496-512, 1995 

A;Authors: Gnehm, C.L.; McDonald, L.A.; Small, K.V. ; Fraser, C.M.; Smith, H.O. ; 
Venter, J.C. 

A; Title: Whole-genome random sequencing and assembly of Haemophilus influenzae 
Rd, 

A;Reference number: A64000; MUID : 95350630; PMID:7542800 
A; Access ion: E64108 

A; Status: nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A; Residues: 1-35 <TIGR> 

A; Cross-references : GB:U32782; GB:L42023; NID : gl574041 ; FID : gl574052 ; 
TIGR:HI1018 

A; Experimental source: strain Rd KW2 0 
C; Genetics: 

A; Mobile element: insertion sequence IS1016 

Query Match 14.3%; Score 4; DB 2; Length 35; 

Best Local Similarity 100.0%; Pred. No. 1.2e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 7 KSIQ 10 

MM 

Db 10 KSIQ 13 



RESULT 8 
D82607 

hypothetical protein XF2026 [imported] - Xylella fastidiosa (strain 9a5c) 
C; Species: Xylella fastidiosa 

C;Date: 18-Aug-2000 #sequence_revision 20-Aug-2000 #text_change 20-Aug-2000 
C; Accession: D82607 

R;anonymous, The Xylella fastidiosa Consortium of the Organization for 
Nucleotide Sequencing and Analysis, Sao Paulo, Brazil. 
Nature 406, 151-157, 2000 

A; Title: The genome sequence of the plant pathogen Xylella fastidiosa. 

A;Reference number: A82515; MUID: 20365717; PMID : 10910347 

A;Note: for a complete list of authors see reference number A59328 below 

A;Accession: D82607 

A ; status : prel iminary 

A; Molecule type: DNA 

A; Residues: 1-35 <SIM> 

A; Cross -references: GB:AE004021; GB:AE003849; NID : g9107139 ; PIDN : AAF84B28 . 1 ; 

GSPDB:GN00128; XFSC:XF2026 

A; Experimental source: strain 9a5c 

R;Simpson, A.J.G.; Reinach, F.C.; Arruda, P.; Abreu, F.A.; Acencio, M. ; 
Alvarenga, R. ; Alves, L.M.C.; Araya, J.E.; Baia, G.S.; Baptista, C.S.; Barros , 
M.H.; Bonaccorsi, E.D.; Bordin, S.; Bove, J.M.; Briones , M.R.S.; Bueno, M.R.P.; 
Camargo , A . A . ; Camargo , L . E . A . ; Carraro , D . M . ; Carrer , H . ; Colauto , N . B . ; 
Colombo, C; Costa, F.F.; Costa, M.C.R.; Costa-Neto, CM.; Coutinho, L.L.; 
Cristofani, M.; Dias-Neto, E. ; Docena, C; El-Dorry, H.; Facincani, A. P.; 
Ferreira, A.J.S. 
submitted to GenBank, June 2000 



A;Authors: Ferreira, V.C.A.; Ferro, J. A.; Fraga, J.S,; Franca, S.C; Franco, 
M.C.; Frohme, M. ; Furlan, L.R. ; Garnier, M.; Goldman, G.H.; Goldman, M.H.S.; 
Gomes, S.L.; Gruber, A.; Ho, P.L.; Hoheisel, J.D./ Junqueira, M.L,; Kemper, 
E.L.; Kitajima, J, P.; Krieger, J.E.; Kuramae, E.E.; Laigret, F. ; Lambais, M.R.; 
Leite, L.C.C.; Lemos, E.G.M.; Lemos, M.V.F.; Lopes, S.A.; Lopes, C.R.; Machado, 
J. A.; Machado, M.A. ; Madeira, A.M.B.N.; Madeira, H.M.F,,- Marino, C.L.; Marques, 
M.V.; Martins, E.A.L. 

A; Authors: Martins, E.M.F.; Matsukuma, A.Y.; Menck, C.F.M.; Miracca, E.G.; 
Miyaki, C.Y. ; Monteiro-Vitorello , C.B.; Moon, D.H.; Nagai, M.A. ; Nascimento, 

A. L.T.O.; Netto, L.E.S.; Nhani Jr., A.; Nobrega, F.G.; Nunes, L.R.; Oliveira, 
M.A.; de Oliveira, M.C.; de Oliveira, R.C. ; Palmieri, D.A.; Paris, A.; Peixoto, 

B. R.; Pereira, G.A.G.; Pereira Jr., H.A.; Pesquero, J.B,; Quaggio, R.B.; 
Roberto, P.G.; Rodrigues, V.; Rosa, A.J. de M. ; de Rosa Jr., V.E.; de Sa, R.G.; 
Santelli, R.V.; Sawasaki, H.E. 

A;Authors: da Silva, A.C.R.; da Silva, F.R. ; da Silva, A.M.; Silva Jr., W.A. ; da 
Silveira, J.F.; Silvestri, M.L.Z.; Siqueira, W. J, ,- de Sousa, A.A. ; de Souza, 
A. P.; Terenzi, M.F.; Truffi, D. ; Tsai, S.M.; Tsuhako, M.H.; Vallada, H. ; Van 
Sluys, M.A.; Verjovski -Almeida, S.; Vettore, A.L.; Zago, M.A. ; Zatz, M. ; 
Meidanis, J.; Setubal, J.C. 
A; Reference number: A59328 
A; Contents : annotation 
C; Genetics : 
A; Gene: XF2 026 

Query Match 14.3%; Score 4; DB 2; Length 35; 

Best Local Similarity 100.0%; Pred. No. 1.2e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 21 LIAE 24 

MM 

Db 30 LIAE 33 



RESULT 9 
JU0352 

4.6K protein - Chlamydophila psittaci phage Chpl 

C;Species: Chlamydophila psittaci phage Chpl, Chlamydia psittaci phage Chpl 
A;Note: host Chlamydia psittaci 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 16-Jun-2000 
C;Accession: JU0352 

R; Storey, C.C.; Lusher, M. ; Richmond, S.J. 
J. Gen. Virol. 70, 3381-3390, 1989 

A;Title: Analysis of the complete nucleotide sequence of Chpl, a phage which 
infects avian Chlamydia psittaci. 

A;Reference number: JU0345; MUID: 90111716 ; PMID: 2607341 
A; Access ion: JU0352 
A; Molecule type: DNA 
A;Residues: 1-36 <ST0> 

A; Cross-references: GB:D00624; NID:g217761; PIDN:BAA00510.1; PID:g217769 
C;Superfamily: Chlamydia psittaci phage Chpl 4.6K protein 

Query Match 14.3%; Score 4; DB 1; Length 36; 

Best Local Similarity 100.0%; Pred. No. 1.3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 



12 LRRR 15 
MM 



Db 



7 LRRR 10 



RESULT 10 
C60071 . 

pancreatic hormone - rhesus macaque 
N;Alternate names: pancreatic polypeptide 
C; Species: Macaca mulatta (rhesus macaque) 

C;Date: 07-May-1993 #sequence_revision 07-May-1993 #text_change 15-Jun-1996 
C;Accession: C60071 

R;Yu, J-; Xin, Y.; Eng, J.; Yalow, R.S. 
Regul- Pept. 32, 39-45, 1991 

A;Title: Rhesus monkey gastroenteropancreatic hormones: relationship to human 
sequences . 

A;Reference number: A60071; MUID : 91164506 ; PMID: 2003150 
A /Access ion: C60071 

A; Status: protein sequence not shown 
A;Molecule type: protein 
A; Residues: 1-36 <YUA> 

A; Note: the sequence is identical with the human sequence 
C; Super family: pancreatic hormone 

C; Keywords: amidated carboxyl end; hormone; pancreas 

F;36/Modif ied site: amidated carboxyl end (Tyr) #status experimental 

Query Match 14.3%; Score 4; DB 2; Length 36; 

Best Local Similarity 100.0%; Fred. No. 1.3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 11 DLRR 14 

MM 

Db 23 DLRR 26 



RESULT 11 
S17507 

cytokine - rabbit 

C;Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 19-Mar-1997 #sequence__revision 19-Mar-1997 #text_change 31-Oct-1997 
C;Accession: S17507 

R;Jose, P.J.; Collins, P.D.; Perkins, J.A. ; Beaubien, B.C.; Totty, N.F.; 
Waterfield, M.D.; Hsuan, J.; Williams, T.J. 
Biochem, J. 278, 493-497, 1991 

A;Title: Identification of a second neutrophil -chemoattractant cytokine 
generated during an inflammatory reaction in the rabbit peritoneal cavity in 
vivo. Purification, partial amino acid sequence and structural relationship to 
melanoma -growth-st imulatory act ivi ty . 

A;Reference number: S17507; MUID: 91378900 ; PMID: 1898341 

A; Access ion: S17507 

A; Status : preliminary 

A;Molecule type: protein 

A;Residues: 1-36 <JOS> 

C ; Super f ami ly : beta - thromboglobul in 

Query Match 14.3%; Score 4; DB 2; Length 36; 

Best Local Similarity 100.0%; Pred. No. 1.3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 
Db 



7 KSIQ 10 

MM 

19 KSIQ 22 



RESULT 12 
S04627 

glutathione transferase (EC 2.5.1.18) 6.0 - Proteus mirabilis (fragment) 
C; Species: Proteus mirabilis 

C;Date: 28-Feb-1990 #sequence_revision 28-Feb-1990 #text_change Ol-Feb-1999 
C; Accession: S04627 

R;Di Ilio, C; Aceto, A.; Piccolomini, R. ; Allocati, N.; Caccuri , A.M.; Barra, 

D. ; Federici, G. 

FEES Lett. 250, 57-59, 1989 

A;Title: N-terminal region of Proteus mirabilis glutathione transferase is not 

homologous to mammalian and plant glutathione transferases. 

A;Reference number: S04627; MUID : 89290034 ; PMID:2661269 

A;Accession: S04627 

A; Molecule type: protein 

A/Residues : 1-38 <DII> 

C; Complex: dimer 

C; Function: 

A; Description: catalyzes conjugation of glutathione to a large variety of 
electrophilic compounds of endobiotic and xenobiotic origin; also involved in 
intracellular binding and transport of hydrophobic compounds; involved in 
detoxification of organic hydroperoxides 
C; Superfamily : glutathione transferase 
C; Keywords: dimer; transferase 

Query Match 14.3%; Score 4; DB 2; Length 38; 

Best Local Similarity 100.0%; Pred. No, 1.3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 11 DLRR 14 

MM 

Db 31 DLRR 34 



RESULT 13 
B39888 

synapsin I - bovine (fragments) 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 31-Jan-1992 #sequence_revision 31-Jan-1992 #text_change 18-Jun-1993 
C; Access ion: B39888 

R;Czernik, A.J.; Pang, D.T. ; Greengard, P. 

Proc. Natl. Acad. Sci. U.S.A. 84, 7518-7522, 1987 

A; Title: Amino acid sequences surrounding the cAMP -dependent and 

calcium/calmodul in-dependent phosphorylation sites in rat and bovine synapsin I. 

A;Reference number: A39888; MUID : 88041137 ; PMID:3118371 

A; Accession: B39888 

A; Status : preliminary 

A; Molecule type: protein 

A; Residues: 1-38 <CZE> 

Query Match 14.3%; Score 4; DB 2; Length 38; 

Best Local Similarity 100.0%; Pred, No. l,3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 12 LRRR 15 

Db 3 LRRR 6 



RESULT 14 
A38335 

58K tubulointerst itial nephritis antigen - rabbit (fragments) 
C;Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 28-Jun-1991 #sequence_revision 24-Feb-1994 #text_change 24-Feb-1994 
C;Accession: A38335; B38335 

R;Butkowski, R.J.; Langeveld, J, P.M.; Wieslander, J.; Brentjens, J.R.; Andres, 
G.A. 

J. Biol. Chem. 265, 21091-21098, 1990 

A; Title: Characterization of a tubular basement membrane component reactive with 

autoantibodies associated with tubulointerstitial nephritis. 

A;Reference number: A38335; MUID : 91065920 ; PMID: 2250013 

A /Access ion: A3 8 33 5 

A; Molecule type: protein 

A; Residues: 1-38 <BUT> 

A; Note: the source of this sequence is the 58K antigen 

A /Access ion: B38335 

A; Molecule type: protein 

A;Residues: 1-12 <BU2> 

A; Note: the source of this sequence is the 50K antigen 

Query Match 14.3%; Score 4; DB 2; Length 38; 

Best Local Similarity 100.0%; Fred. No, 1.3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 22 lAEI 25 

MM 

Db 26 lAEI 2 9 



RESULT 15 
E86077 

hypothetical protein Z5430 [imported] - Escherichia coli (strain 0157 :H7, 

substrain EDL933) 

C; Species: Escherichia coli 

C;Date: 16-Feb-2001 #sequence_revision 16-Feb-2001 #text_change 14-Sep-2001 
.C; Accession: E86077 

R;Perna, N.T.; Plunkett III, G.; Burland, V.; Mau, B.; Glasner, J.D. ; Rose, 
D.J.; Mayhew, G.F.; Evans, P.S.; Gregor, J.; Kirkpatrick, H.A. ; Posfai, G. ; 
Hackett, J.; Klink, S.; Boutin, A.; Shao, Y, ; Miller, L. ; Grotbeck, E.J.; Davis, 
N.W.; Lim, A.; Dimalanta, E.; Potamousis, K. ; Apodaca, J.; Anantharaman, T.S.; 
Lin, J.; Yen, G. ; Schwartz, D.C.; Welch, R.A.; Blattner, F.R. 
Nature 409, 529-533, 2001 

A;Title: Genome sequence of enterohemorrhagic Escherichia coli 0157 :H7. 

A; Reference number: A85480; MUID : 21074935 ; PMID : 11206551 

A; Access ion: E86077 

A; Status: preliminary 

A; Molecule type: DNA 

A/Residues : 1-38 <STO> 

A; Cross-references: GB:AE005174; NID:gl2518780 ; PIDN:AAG59081 . 1; GSPDB : GN00145 ; 
UWGP:Z543 0 



A; Experimental source: strain 0157 :H7, substrain EDL933 
C; Genetics : 
A;Gene: Z5430 



Query Match 14.3%; Score 4; DB 2; Length 38; 

Best Local Similarity 100.0%; Pred. No, 1.3e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 LHNL 5 

MM 

Db 20 LHNL 23 



RESULT 16 
S67938 

hypothetical protein 3 - Streptococcus pneumoniae 
C;Species: Streptococcus pneumoniae 

C;Date: 05-Dec-1996 #sequence_revision 13-Mar-1997 #text_change 13-Mar-1997 

C; Accession: S67938 

R;Saluja, S.K. ; Weiser, J.N. 

Mol. Microbiol, 16, 215-227, 1995 

A;Title: The genetic basis of colony opacity in Streptococcus pneumoniae: 
evidence for the effect of box elements on the frequency of phenotypic 
variation. 

A;Reference number: S67936; MUID: 96015435 ; PMID: 7565084 

A;Accession: S67938 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-3 9 <SAL> 

A; Cross-references : EMBL:U12567 

Query Match 14.3%; Score 4; DB 2; Length 39; 

Best Local Similarity 100.0%; Pred. No. 1.4e+03; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 5 LGKS 8 

MM 

Db 5 LGKS 8 



RESULT 17 
G9BPSV 

gene 9 protein - spiroplasma virus 4 
C;Species: spiroplasma virus 4, SpV4 
A; Note: host Spiroplasma melliferum 

C;Date: 31-Mar-1989 #sequence_revision 31-Mar-1989 #text_change 07-Dec-1999 
C; Access ion: B2 9825 

R;Renaudin, J.; Pascarel, M.C.; Bove, J.M. 
J. Bacteriol. 169, 4950-4961, 1987 

A;Title: Spiroplasma virus 4: nucleotide sequence of the viral DNA, regulatory 

signals, and proposed genome organization. 

A;Reference number: A91845; MUID: 88032809 ; PMID:2822658 

A;Accession: B29825 

A; Molecule type: DNA 

A;Residues: 1-28 <REN> 

A; Cross-references: GB:M17988; NID:g334998 

A;Note: this ORF is not annotated in GenBank entry SPVDNA 



C;Comment: This virus is a procaryote DNA virus. 
C; Genetics : 
A; Gene: 9 

A; Genetic code: SGC3 

C;Superfamily: spiroplasma virus 4 gene 9 protein 

Query Match 10.7%; Score 3; DB 1; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 13 RRR 15 

Ml 

Db 17 RRR 19 



RESULT 18 
S41774 

ubiquinol-cytochrome-c reductase (EC 1.10.2.2) cytochrome b - Trypanosoma 

congolense mitochondrion (fragment) 

C;Species: mitochondrion Trypanosoma congolense 

C;Date: 25-Dec-1994 #sequence_revision 27-Feb-1997 #text_change 03-Jun-2002 
C; Accession: S41774 

R;Read, L.K.; Fish, W.R.; Muthiani , A.M.; Stuart, K. 
Nucleic Acids Res. 21, 4073-4078, 1993 

A; Title: Maxicircle DNA and edited mRNA sequences of closely related trypanosome 

species: implications of kRNA editing for evolution of maxicircle genomes. 

A;Reference number: S41774; MUID: 93382785 ; PMID: 8396763 

A;Accession: S41774 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 1-28 <REA> 

C;Superfamily: cytochrome b; cytochrome b homology; cytochrome b6 homology; 
plastoquinol-plastocyanin reductase 17K protein homology 
C;Keywords: electron transfer; mitochondrion; oxidative phosphorylation; 
oxidoreductase; respiratory chain; RNA editing 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



Db 



16 FFL 18 

III 
9 FFL 11 



RESULT 19 
S66436 

allophycocyanin alpha-B chain - Anabaena sp. (strain PCC 7120) (fragment) 
C; Species: Anabaena sp . 
A;Variety: PCC 7120 

C;Date: 15-Feb-1997 #sequence_revision 13-Mar-1997 #text_change 24-Oct-1997 
C;Accession: S66436 

R;Ducret, A.; Sidler, W. ; Wehrli, E.; Frank, G.; Zuber, H. 
Eur. J. Biochem. 236, 1010-1024, 1996 

A; Title: Isolation, characterization and electron microscopy analysis of a 
hemidiscoidal phycobil isome type from the cyanobacterium Anabaena sp. PCC 7120. 
A;Reference number: S66435; MUID : 96270757 ; PMID:8665889 



A/Accession: S66436 
A; Status: preliminary 
A; Molecule type: protein 
A; Residues: 1-28 <DUC> 
C; Superfamily : phycocyanin 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 7 KSI 9 

III 

Db 23 KSI 25 



RESULT 2 0 
C33948 

glutathione transferase (EC 2.5.1.18) 5 - chicken (fragment) 
N;Alternate names: glutathione S-alkyltransf erase; glutathione S- 
aralkyl transferase; glutathione S-aryltransf erase 
C; Species: Callus gallus (chicken) 

C;Date: 07-Jun-1990 #sequence_revision 07-Jun-1990 #text_change 03-Nov-2000 
C;Accession: C33948 

R;Chang, L.H.; Chuang, L.F.; Tsai, CP.; Tu, C.P.D.; Tam, M.F. 
Biochemistry 29, 744-750, 1990 

A; Title: Characterization of glutathione S-transf erases from day-old chick 
livers . 

A;Reference number: A90547; MUID: 90248419 ; PMID:2337594 

A; Accession: C33 94 8 

A; Molecule type: protein 

A;Residues: 1-28 <CH3> 

C ; Superfamily : glutathione transferase 

C; Keywords : transferase 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 23 AEI 25 

III 

Db 14 AEI 16 



RESULT 21 
A31859 

deoxycytidine kinase (EC 2.7,1.74) / deoxyadenosine kinase (EC 2.7.1.76) - 
Lactobacillus acidophilus (fragment) 
C; Species: Lactobacillus acidophilus 

C;Date: 07-Jun-1990 #sequence_revision 07-Jun-1990 #text_change 18-Sep-1998 
C;Accession: A31859 

R;Ikeda, S,; Swenson, R.P.; Ives, D.H. 
Biochemistry 27, 8648-8652, 1988 

A;Title: Amino - terminal nucleot ide-binding sequences of a Lactobacillus 

deoxynucleoside kinase complex isolated by novel affinity chromatography. 

A;Reference number: A31859; MUID: 89118283 ; PMID:2851331 

A; Access ion: A3 185 9 

A; Molecule type: protein 



A;Residues: 1-28 <IKE> 

A;Note: 18-Ile, 21-Lys, 26-Gln, and 27-Ala were also found 
C;Superfamily : Lactobacillus acidophilus deoxyadenosine kinase 
C; Keywords : phosphotransferase 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKS 8 

III 

Db 11 GKS 13 



RESULT 22 
B54257 

deoxynucleoside kinase complex I F-component - Lactobacillus acidophilus 
(fragment) 

N;Alternate names: dCyd kinase subunit 
C;Species: Lactobacillus acidophilus 

C;Date: 06-Oct-1994 #sequence_revision 18-Nov-1994 #text_change 18-Sep-1998 

C;Accession: B54257 

R;Ikeda, S.; Ma, G.T,; Ives, D.H, 

Biochemistry 33, 5328-5334, 1994 

A; Title; Heterodimeric deoxynucleoside kinases of Lactobacillus acidophilus R- 
26: functional assignment of subunits using limited proteolysis controlled by 
end-product inhibitors . 

A;Reference number: A54257; MUID: 94227067; PMID: 8172906 

A;Accession: B54257 

A ; Status : prel iminary 

A;Molecule type: protein 

A;Residues: 1-28 <IKE> 

A; Experimental source: R-2 6 

A;Note: sequence extracted from NCBI backbone (NCBIP : 146749) 
C; Super family: Lactobacillus acidophilus deoxyadenosine kinase 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKS 8 

III 

Db 11 GKS 13 



RESULT 23 
D41912 

T-cell receptor beta chain (AE16) - mouse (fragment) 
C; Species: Mus musculus (house mouse) 

C;Date: 03-Feb-1994 #sequence_revision 03-Feb-1994 #text_change 17-Mar-1999 
C; Accession: D41912 

R;Hong, S.C.; Chelouche, A.; Lin, R.; Shaywitz, D. ; Braunstein, N.S.; Glimcher, 
L . ; Janeway Jr . , C . A . 
Cell 69, 999-1009, 1992 

A;Title: An MHC interaction site maps to the amino-terminal half of the T cell 
receptor alpha chain variable domain. 

A;Reference number: A41912; MUID: 92298397 ; PMID: 1318787 



A /Access ion: D41912 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A; Residues: 1-28 <HON> 

C;Superfamily: immunoglobulin V region; immunoglobulin homology 
C;Keywords: T-cell receptor 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 11 DLR 13 

III 

Db 25 DLR 27 



RESULT 24 
150169 

alpha-1 type-1 collagen - chicken (fragment) 
C; Species: Gallus gallus (chicken) 

C;Date: 13-Sep-1996 #sequence_revision 13-Sep-1996 #text_change 13-Aug-1999 
C; Accession: 150169 

R ; Yamada , Y . ; Mudry j , M . ; de Crombrugghe , B . 
J. Biol. Chem. 258, 14914-14919, 1983 

A; Title: A uniquely conserved regulatory signal is found around the translation 
initiation site in three different collagen genes. 
A;Reference number: 150169; MUID: 84087884 ; PMID:6689169 
A; Access ion: 150169 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-28 <YAM> 

A; Cross-references: GB:K01482; NID:g211287; PIDN: AAA48 636 . 1 ; PID:g555426 
C;Superfamily: collagen alpha 1(1) chain; fibrillar collagen carboxyl -terminal 
homology; von Willebrand factor type C repeat homology 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 21 LIA 23 

III 

Db 12 LIA 14 



RESULT 2 5 
S21231 

calcium-binding protein SCP VI, sarcoplasmic - common lancelet 
C; Species: Branchiostoma lanceolatum (common lancelet) 

C;Date: 19-Mar-1997 #sequence_revision 25-Apr-1997 #text_change 02-Aug-2002 
C;Accession: S21231 

R;Takagi, T.; Valette-Talbi, L.; Cox, J, A. 
FEBS Lett. 302, 159-160, 1992 

A; Title: Primary structure of three minor isoforms of amphioxus sarcoplasmic 
calcium-binding proteins. 

A;Reference number: S21153; MUID: 92339504 ; PMID: 1633848 
A;Accession: S21231 
A;Molecule type: protein 



A;Residues: 1-28 <TAK> 

C; Super family : calmodulin; calmodulin repeat homology 
C; Keywords: calcium binding; EF hand 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



Db 



16 FFL 18 

III 
5 FFL 7 



RESULT 26 
A05296 

fibrinogen alpha chain - dog (fragment) 
C; Species: Canis lupus familiaris (dog) 

C;Date: 05-Jun-1987 #sequence_revision 05-Jun-1987 #text_change 25-Oct-1996 
C;Accession: A94308; A0311B; A37511; A05296; B37511; C03118 
R;Birken, S.; Wilner, G.D.; Canfield, R.E. 
Thromb. Res. 1, 599-610, 1975 

A; Title: Studies of the structure of canine fibrinogen. 

A;Reference number: A94308; MUID: 76081726 ; PMID: 1198547 

A; Access ion: A94308 

A; Molecule type: protein 

A; Residues: 1-28 <BIR> 

R;Blombaeck, B.; Blombaeck, M. ; Groendahl , M.J. 

Acta Chem. Scand. 19, 1789-1791, 1965 

A; Title: Studies on f ibrinopeptides from mammals. 

A; Reference number: A03118 

A;Accession: A03118 

A; Molecule type: protein 

A; Residues: 1-16 <BLO> 

R;Osbahr Jr., A.J,; Colman, R.W.; Laki , K. ; Gladner, J. A. 

Biochem. Biophys. Res. Commun. 14, 555-558, 1964 

A;Reference number: A37511; MUID: 66020594 ; PMID:5836555 

A;Accession: A37511 

A; Molecule type: protein 

A;Residues: 1, 'D' ,3, 'EGKQ' ,8-16 <OSB> 

C; Superfamily : fibrinogen alpha chain; fibrinogen disulfide ring homology 

C; Keywords: blood coagulation; liver; phosphoprotein; plasma 

F; 1-16/Product : f ibrinopept ide A #status experimental <APT> 

F;3/Binding site: phosphate (Ser) (covalent) (partial) #status experimental 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 22 lAE 24 

III 

Db 9 lAE 11 



RESULT 27 
B35577 

cell adhesion receptor CD36 - bovine (fragment) 

N;Alternate names: membrane glycoprotein PAS IV; platelet glycoprotein IV 



C; Species: Bos primigenius taurus (cattle) 

C;Date: 21-Sep-1990 #sequence_revision lO-May-1996 #text_change 29-Aug-1997 
C;Accession: B35577; A37884 

R;Greenwalt, D.E.; Watt, K.W.K.; So, O.Y.; Jiwani, N. 
Biochemistry 29, 7054-7059, 1990 

A; Title: PAS IV, an integral membrane protein of mammary epithelial cells, is 

related to platelet and endothelial cell CD36 (GP IV) . 

A;Reference number: A35577; MUID : 91027734 ; PMID: 1699598 

A;Accession: B35577 

A; Molecule type: protein 

A;Residues: 1-28 <GRE> 

R;Greenwalt , D.E.; Watt, K.W.K.; Hasler, T. ; Howard, R. J. ; Patel , S. 
J. Biol. Chem. 265, 16296-16299, 1990 

A;Title: Structural, functional, and antigenic differences between bovine heart 

endothelial CD36 and human platelet CD36. 

A/Reference number: A37884; MUID: 90375491 ; PMID: 1697855 

A;Accession: A37884 

A; Molecule type: protein 

A;Residues: 1-21 <GR2> 

C; Superf amily : lysosomal integral membrane protein II 

C;Keywords: cell adhesion; glycoprotein; receptor; surface antigen; 

transmembrane protein 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



Db 



21 LIA 23 

III 
8 LIA 10 



RESULT 28 
PN0047 

signal transduction protein QM0017 - mouse (fragments) 
C;Species: Mus musculus (house mouse) 

C;Date: 29-Oct-1997 #sequence_revision 29-Oct-1997 #text_change 15-Oct-1999 
C;Accession: PN0047 
R;Kato, H. 

Kawasaki Igakkaishi 22, 245-259, 1996 

A; Title: Analysis of proteins isolated by two dimensional electrophoresis of 

mouse neuroblastoma cells. 

A; Reference number: PN0041 

A; Access ion: PN0047 

A;Molecule type: protein 

A;Residues: 1-28 <KAT> 

A; Experimental source: neuroblastoma cell 

C; Comment: The molecular mass is 25,600 and the pi is 6.07. The amino -terminus 
is blocked. 

C; Superf amily : signal transduction protein DJ-1 
C ; Keywords : bra in 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



4 NLG 6 



Db 



RESULT 2 9 
148178 

orphan receptor - rat (fragment) 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 24-Sep-1999 
C; Accession: 148178 

R;Pena de Ortiz, S.; Cannon, M.M.; Jamieson, G.A. 
Brain Res. Mol , Brain Res. 23, 278-283, 1994 

A; Title: Expression of nuclear hormone receptors within the rat hippocampus 

identification of novel orphan receptors. 

A/Reference number: 148178; MUID: 94335560; PMID: 7914660 

A; Access ion: 148178 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-28 <RES> 

A; Cross-references: GB:L19344; NID:g349093; PIDN : AAB463 96 . 1 ; PID:g565378 
C; Super family: unassigned erbA-related proteins; erbA transforming protein 
homology 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Fred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 8 SIQ 10 

Ml 

Db 6 SIQ 8 



RESULT 3 0 
B56779 

tetM 5 ' -region leader peptide - Enterococcus faecalis transposon Tn916 
C; Species: Enterococcus faecalis 

C;Date: 08-Sep-1995 #sequence_revision 08-Sep-1995 #text_change 24-Sep-1999 

C; Access ion: B56779 

R;Su, Y.A.; He, P.; Clewell, D.B. 

Antimicrob. Agents Chemother. 36, 769-778, 1992 

A; Title: Characterization of the tet(M) determinant of Tn916: evidence for 
regulation by transcription attenuation. 

A;Reference number: A56779; MUID : 92368175 ; PMID: 1323953 
A; Accession: B56779 
A; Status : prel iminary 
A; Molecule type: DNA 
A;Residues: 1-28 <SU1> 

A; Cross-references: GB:U09422; NID:g532533; PIDN: AAB60021 . 1 ; PID:g532545 
A;Note: sequence extracted from NCBI backbone (NCBIN: 111076 , NCBIP : 111079) 
C;Superfamily : unassigned leader peptides 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 7 KSI 9 

III 



Db 



15 KSI 17 



RESULT 31 
G69384 

conserved hypothetical protein AF1079 - Archaeoglobus fulgidus 
C;Species: Archaeoglobus fulgidus 

C;Date: 05-Dec-1997 #sequence_revision 05-Dec-1997 #text_change 22-Oct-1999 
C;Accession: G69384 

R;Klenk, H.P.; Clayton, R.A.; Tomb, J.F.; White, 0.; Nelson, K.E,; Ketchum, 
K.A. ; Dodson, R.J,; Gwinn, M.; Hickey, Peterson, J.D.; Richardson, D.L.; 

Kerlavage, A.R.; Graham, D.E.; Kyrpides, N,C.; Fleischmann, R.D.; Quackenbush, 
J.; Lee, N.H.; Sutton, G.G.; Gill, S.; Kirkness, E.F.; Dougherty, B.A.; McKenny, 
K. ; Adams, M.D.; Loftus, B.; Peterson, S.; Reich, C.I.; McNeil, L.K.; Badger, 
J.H.; Glodek, A.; Zhou, L. ; Overbeek, R.; Gocayne, J.D.; Weidman, J.F,; 
McDonald, L. 

Nature 390, 364-370, 1997 

A;Authors: Utterback, T. ; Cotton, M.D.; Spriggs, T.; Artiach, P.; Kaine, B.P.; 
Sykes, S.M,; Sadow, P.W.; D'Andrea, K.P.; Bowman, C; Fujii, C. ; Garland, S.A.; 
Mason, T.M.; Olsen, G. J. ; Fraser, CM.; Smith, H.O. ; Woese, C.R.; Venter, J.C. 
A;Title: The complete genome sequence of the hyperthermophilic, sul fate-reducing 
archaeon Archaeoglobus fulgidus . 

A;Reference number: A69250; MUID: 98049343 ; PMID: 9389475 
A;Accession: G69384 

A;Status: preliminary; nucleic acid sequence not shown; translation not shown 
A;Molecule type: DNA 
A;Residues: 1-28 <KLE> 

A; Cross -references: GB:AE001028; GB:AE000782; NID : g2689351 ; PIDN: AAB90160 . 1 ; 
PID:g2649506; TIGR:AF1079 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 11 DLR 13 

111 

Db 21 DLR 23 



RESULT 32 
A69259 

hypothetical protein AF0073 - Archaeoglobus fulgidus 
C; Species : Archaeoglobus fulgidus 

C;Date: 05-Dec-1997 #sequence_revision 05-Dec-1997 #text_change 22-Oct-1999 
C;Accession: A69259 

R;Klenk, H.P.; Clayton, R.A.; Tomb, J.F.; White, O. ; Nelson, K.E.; Ketchum, 
K.A. ; Dodson, R.J.; Gwinn, M.; Hickey, E.K,; Peterson, J.D.; Richardson, D.L.; 
Kerlavage, A.R.; Graham, D.E.; Kyrpides, N.C; Fleischmann, R.D.; Quackenbush, 
J.; Lee, N.H.; Sutton, G.G.; Gill, S.; Kirkness, E.F.; Dougherty, B.A.; McKenny, 
K. ; Adams, M.D.; Loftus, B.; Peterson, S.; Reich, C.I.; McNeil, L.K.; Badger, 
J.H.; Glodek, A.; Zhou, L.; Overbeek, R.; Gocayne, J.D.; Weidman, J.F.; 
McDonald, L. 

Nature 390, 364-370, 1997 

A;Authors: Utterback, T. ; Cotton, M.D.; Spriggs, T. ; Artiach, P.; Kaine, B.P.; 
Sykes, S.M.; Sadow, P.W. ; D'Andrea, K.P.; Bowman, C. ; Fujii, C. ; Garland, S.A.; 
Mason, T.M. ; Olsen, G. J. ; Fraser, CM.; Smith, H.O. ; Woese, CR.; Venter, J.C. 



A;Title: The complete genome sequence of the hyperthermophilic, sulf ate-reducing 
archaeon Archaeoglobus f ulgidus . 

A;Reference number: A69250; MUID: 98049343 ; PMID:9389475 
A;Accession: A69259 

A;Status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A;Residues: 1-28 <KLE> 

A; Cross -references: GBiAEOOllOl; GB:AE000782; NID:g2689424 ; PIDN : AAB91159 , 1 ; 
PID:g2650579; TIGR:AF0073 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Fred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



23 AEI 25 
III 

17 AEI 19 



RESULT 33 
T14905 

hypothetical protein - parsley 

C; Species: Petroselinum crispum (parsley) 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 20-Sep-1999 
C;Accession: T14905 

R;Feldbrugge, M.; Sprenger, M.; Dinkelbach, M. ; Yazaki, K. ; Harter, K. ; 
Weisshaar, B. 

Plant Cell 6, 1607-1621, 1994 

A;Title: Functional analysis of a light -responsive plant bZIP transcriptional 
regulator. 

A;Reference number: Z18259; MUID: 95128172 ; PMID:7827494 
A;Accession: T14905 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-28 <FEL> 

A; Cross-references: EMBL:S75395; NID:g913201; PID:el94726 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 

Db 



1 LLH 3 
III 

14 LLH 16 



RESULT 34 
JQ0272 

hypothetical 3K protein (trnH-trnV intergenic region) - rice chloroplast 
C; Species: chloroplast Oryza sativa (rice) 

C;Date: 31-Mar-1990 #sequence_revision 31-Mar-1990 #text_change 21-Jul-2000 
C;Accession: JQ0272; S05152 

R;Shimada, H. ; Whittier, R.F.; Hiratsuka, J.; Maeda, Y.; Hirai, A.; Sugiura, M. 
submitted to JIPID, December 1989 
A; Reference number: JQ02 00 
A;Accession: JQ0272 
A; Molecule type: DNA 



A;Residues: 1-28 <SHI> 

A; Experimental source: cv. Nihonbare 

R;Hiratsuka, J.; Shimada, H. ; Whittier, R, ; Ishibashi, T. ; Sakamoto, M.; Mori, 
M.; Kondo, C; Honji, Y. ; Sun, C.R.; Meng, B.Y.; Li, Y.Q. ; Kanno, A.; Nishizawa, 
Y.; Hirai, A.; Shinozaki, K. ; Sugiura, M. 
Mol. Gen. Genet. 217, 185-194, 1989 

A;Title: The complete sequence of the rice (Oryza sativa) chloroplast genome: 
intermolecular recombination between distinct tRNA genes accounts for a major 
plastid DNA inversion during the evolution of the cereals. 
A;Reference number: S05080; MUID : 89364698 ; PMID : 2770692 
A;Accession: S05152 

A;Status: nucleic acid sequence not shown; translation not shown 
A /Molecule type: DNA 
A;Residues: 1-28 <HIR> 

A; Cross -references: GB:X15901; NID:gll957; PIDN: CAA33 939 . 1 ; PID:gl2032 

A;Note: this sequence was submitted to EMBL, July 1989 

C; Genetics : 

A ; Genome : ch 1 crop last 

C; Keywords : chloroplast 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



19 HHL 21 

Ml 

19 HHL 21 



RESULT 35 
S07826 

venom protein - American tarantula (Eurypelma californica) (fragment) 
C; Species: Eurypelma californica 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change 18-Jun-1993 
C; Accession: S07826 
R;Savel -Niemann, A. 

Biol. Chem. Hoppe-Seyler 370, 485-498, 1989 

A;Title: Tarantula (Eurypelma californicum) venom, a multi component system. 

A;Reference number: S04224; MUID: 89302691 ; PMID:2742756 

A; Accession: S07826 

A;Molecule type: protein 

A /Residues: 1-28 <SAV> 

C ; Keywords : venom 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKS 8 

III 

Db 20 GKS 22 



RESULT 36 
154183 

cell adhesion regulator - human (fragment) 
C; Species: Homo sapiens (man) 



C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 21-Jul-2000 

C; Access ion: 154183 

R ; Koyama , K . ; Emi , M . ; Nakamura , Y . 

Genomics 16, 264-265, 1993 

A;Title: The cell adhesion regulator (CAR) gene; TaqI and insertion/deletion 
polymorphism, and regional assignment to the peritelomeric region of 16q by 
linkage analysis. 

A;Reference number: 154183; MUID: 93252394 ; PMID: 8098008 
A; Access ion: 154183 

A; status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-28 <RES> 

A; Cross-references: GB:D14075; NID:g219525; PIDN : BAA03160 . 1 ; PID:g219526 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 
Db 



12 LRR 14 
III 

14 LRR 16 



RESULT 37 
JQ1035 

hypothetical 3. 2K protein (type I IGFR 5' region) - human 
C; Species: Homo sapiens (man) 

C;Date: 16-Jul-1999 #sequence_revision 16-Jul-1999 #text_change 16-Jul-1999 
C; Access ion: JQ1035 

R;Cooke, D.W.; Bankert , L.A. ; Roberts Jr., C.T.; LeRoith, D. ; Casella, S.J. 
Biochem. Biophys . Res. Commun. 177, 1113-1120, 1991 

A;Title: Analysis of the human type I insulin-like growth factor receptor 
promotor region. 

A;Reference number: PQ0159; MUID : 91282751 ; PMID: 1711844 
A; Access ion: JQ1035 
A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-28 <C00> 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



12 LRR 14 
III 

18 LRR 20 



RESULT 38 
S51593 

myrB protein - Micromonospora griseorubida (fragment) 
C; Species: Micromonospora griseorubida 

C;Date: 19-Mar-1997 #sequence_revision 18-Jul-1997 #text_change 05-Nov-1999 
C;Accession: S51593 

R;Inouye, M. ; Takada, Y, ; Muto, N.; Beppu, T.; Horinouchi, S. 
Mol. Gen. Genet. 245, 456-464,, 1994 



A;Title: Characterization and expression of a P-450-like mycinamicin 
biosynthesis gene using a novel Micromonospora-Escherichia coli shuttle cosmid 
vector. 

A/Reference number: S51593; MUID : 95107242 ; PMID: 7808395 
A; Access ion: S51593 
A; Molecule type: DNA 
A;Residues: 1-28 <INO> 

A; Cross-references: EMBL: D16098 ; NID:g286050; PIDN : BAA03671 . 1 ; FID : dl004 186 ; 

PID:g829046 

C;Genetics: 

A; Gene: myrB 

Query Match 10.7%; Score 3; DB 2; Length 28; 

Best Local Similarity 100.0%; Pred. No. l,le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 13 RRR 15 

III 

Db 22 RRR 24 



RESULT 39 
JN0014 

GABA(A) receptor alpha-1 chain /glycine receptor alpha-1 chain engineered fusion 
protein - synthetic (fragment) 
C; Species : synthetic 

A;Note: Rattus norvegicus (Norway rat) gene 5' region with engineered signal 
sequence derived from GABA(A) receptor 

C;Date: 31-Mar-1990 #sequence_revision 30-Apr-1998 #text_change 13-Aug-1999 
C;Accession: JN0014 

R;Sontheimer, H. ; Becker, CM.; Pritchett, D,B.; Schofield, P.R. ; Grenningloh, 
G.; Kettenmann, H.; Betz, H.; Seeburg, P.H. 
Neuron 2, 1491-1497, 1989 

A; Title: Functional chloride channels by mammalian cell expression of rat 
glycine receptor subunit . 

A;Reference number: JN0014; MUID: 90180468 ; PMID:2483325 

A;AccesSion: JN0014 

A; Molecule type: mRNA 

A;ResidueS: 1-28 <SON> 

C; Comment: See PIR:S2 0662. 

F; 1-20/Region: GABA(A) receptor alpha-1 chain-derived 
F; 21-28/Region: glycine receptor alpha-1 chain-derived 

Query Match 10.7%; Score 3; DB 4; Length 28; 

Best Local Similarity 100.0%; Pred. No. l.le+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 6 GKS 8 

III 

Db 2 GKS 4 



RESULT 4 0 
A55527 

pyrroloquinoline quinone precursor pqqD - Methyl obacterium extorquens 
C; Species: Methylobacterium extorquens 

C;Date: 18-Feb-2000 #sequence_revision 18-Feb-2000 #text_change 18-Feb-2000 



C; Access ion: A55527 

R/Morris, C.J.; Biville, F. ; Turlin, E.; Lee, E, ; Ellermann, K. ; Fan, W,H.; 
Ramamoorthi, R.; Springer, A.L. ; Lidstrom, iyi,E, 
J. Bacteriol. 176, 1746-1755, 1994 

A;Title: Isolation, phenotypic characterization, and complementation analysis of 
mutants of Methyl obacterium extorquens AMI unable to synthesize pyrrol oquinoline 
quinone and sequences of pqqD, pqqG, and pqqC. 
A/Reference number: A55527; MUID: 94179111 ; PMID: 8132470 
A;Accession: A55527 
A; Status : preliminary 
A; Molecule type: DNA 
A; Residues: 1-29 <MOR> 

A; Cross -references: GB:L25889; NID : g41458 9 ; PIDN : AAA17878 . 1 ; PID:g414590 
C; Genetics: 
A; Gene: pqqD 

C;Superfamily : pyrroloquinoline quinone precursor pqqA 
C; Keywords : quinoprotein 

F; 16, 20/Product : pyrroloquinoline quinone #status predicted <MAT> 

F; 16-20/Cross-link: pyrroloquinoline quinone (Glu, Tyr) #status predicted 

Query Match 10.7%; Score 3; DB 1; Length 29; 

Best Local Similarity 100.0%; Pred. No, 1.2e+04; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 
Db 



23 AEI 25 
III 

23 AEI 25 



Search completed: January 14, 2004, 10:37:32 
Job time : 11.2866 sees 
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Title: 

Perfect score: 
Sequence: 

Scoring table: 
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(without alignments) 
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8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 
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10: / cgn2_6 /p t oda t a / 1 /pubpaa /US 0 9B_PUBC0MB . pep 
11 : /cgn2_6/ptodata /I /pubpaa /US09C_PUBCOMB. pep 
12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: 
13 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep 
14 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB .pep 
15 : /cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep 
16 : /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep 
17 : /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB .pep 
18 : /cgn2_6/ptodata/l/pubpaa/US60_PUBC0MB.pep 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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195 


5 


17. 


.9 


33 


12 


US-10-300-083-14 


Sequence 


14, Appl 


196 


5 


17. 


.9 


33 


12 


US-10-300-083-15 


Sequence 


15, Appl 


197 


5 


17, 


.9 


33 


12 


US-10-300-083-16 


Sequence 


16, Appl 


198 


5 


17 , 


. 9 


33 


12 


US-10-300-083-17 


Sequence 


17, Appl 


199 


5 


17 


. 9 


33 


12 


US-10-300-083-18 


Sequence 


18, Appl 


200 


5 


17 


. 9 


33 


12 


US-10-300-083-20 


Sequence 


20, Appl 


201 


5 


17 


.9 


33 


12 


US-10-300-083-21 


Sequence 


21, Appl 


202 


5 


17 


.9 


33 


12 


US-10-300-083-22 


Sequence 


22 , Appl 


203 


5 


17 


.9 


33 


12 


US-10-300-083-23 


Sequence 


23, Appl 


204 


5 


17 


.9 


33 


12 


US-10-300-083-26 


Sequence 


26, Appl 


205 


5 


17 


.9 


33 


12 


US-10-300-083-27 


Sequence 


27, Appl 


206 


5 


17 


.9 


33 


12 


US-10-300-083-28 


Sequence 


28, Appl 


207 


5 


17 


.9 


33 


12 


US-10-300-083-29 


Sequence 


29, Appl 


208 


5 


17 


.9 


33 


12 


US-10-300-083-30 


Sequence 


30, Appl 


209 


5 


17 


. 9 


34 


9 


US-09-169-786-3 


Sequence 3, Appli 


210 


5 


17 


,9 


34 


10 


US-09-928-047B-6 


Sequence 


6 , Appl i 


211 


5 


17 


,9 


34 


11 


US-09-843-221A-16 


Sequence 


16, Appl 


212 


5 


17 


,9 


34 


11 


US-09-843-221A-19 


Sequence 


19, Appl 


213 


5 


17 


-9 


34 


11 


US-09-843-221A-20 


Sequence 


20, Appl 


214 


5 


17 


.9 


34 


11 


US-09-843-221A-22 


Sequence 


22, Appl 


215 


5 


17 


.9 


34 


11 


US-09-843-221A-24 


Sequence 


24, Appl 


216 


5 


17 


.9 


34 


11 


US-09-843-221A-25 


Sequence 


25, Appl 


217 


5 


17 


.9 


34 


11 


US-09-843-221A-88 


Sequence 


88, Appl 


218 


5 


17 


.9 


34 


11 


US-09-843-221A-89 


Sequence 


89, Appl 


219 


5 


17 


.9 


34 


11 


US-09-843-221A-90 


Sequence 


90, Appl 


220 


5 


17 


.9 


34 


11 


US-09-843-221A-91 


Sequence 


91, Appl 


221 


5 


17 


,9 


34 


11 


US-09-843-221A-92 


Sequence 


92, Appl 


222 


5 


17 


.9 


34 


11 


US-09-843-221A-122 


Sequence 


122, App 


223 


5 


17 


.9 


34 


11 


US-09-843-221A-123 


Sequence 


123, App 


224 


5 


17 
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34 


11 


US-09-843-221A-128 


Sequence 


128, App 


225 


5 


17 


,9 


34 


11 


US-09-843-221A-161 


Sequence 


161, App 


226 


5 


17 


.9 


34 


11 


US-09-843-221A-164 


Sequence 


164, App 


227 


5 


17 


.9 


34 


12 


US-09-928-048A-6 


Sequence 


6, Appli 



228 
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17 


9 


34 


12 


US-10-361-928-1 


Sequence 1, Appli 


229 


5 


17 


9 


34 


12 


US-10-361-928-2 


Sequence 2, Appli 


230 


5 


17 


9 


34 


12 


US -10-361- 92 8-5 


Sequence 5; Appli 


231 
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17 


9 


34 


12 


US-10-361-928-8 


Sequence 8 , Appl i 
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17 
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US -10-3 72 -095-22 


Sequence 22, Appl 
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17 


9 


34 


12 


US- 10 -3 72 -095 -24 


Sequence 24 , Appl 


234 


5 


17 


9 


34 


12 


US -10-340-484-15 


Sequence 15, Appl 
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5 


17 


9 


34 


12 


US -10-340-484-16 


Sequence 16, Appl 


226 


5 


17 


9 


34 


12 


US -10-340-484-17 


Sequence 17, Appl 


237 


5 


17 


9 


34 


12 


US -10-340-484-18 


Sequence 18, Appl 
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5 


17 


9 


34 
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US -10-340-484-19 


Sequence 19, Appl 


239 
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17 


9 


34 
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us -10-340-484-20 


Sequence 20, Appl 
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17 
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US -10-340-484-22 


Sequence 22, Appl 
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17 


9 


34 


12 


US -10-340-484-23 


Sequence 23, Appl 
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17 


9 
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14 


US -10-016-403 -5 


Sequence 5, Appli 


243 
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9 


34 


14 


US-10-016-403-6 


Sequence 6, Appli 
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17 
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US -10-016-403-7 


Sequence 7, Appli 
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US -10-097-079-1 


Sequence 1, Appli 
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17 


9 


34 


14 


US -10-097- 07 9-46 


Sequence 46, Appl 
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17 


9 


34 


14 


US-10-097-079-75 


Sequence 75, Appl 
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US -09 -843 -22 lA- 15 


Sequence 15, Appl 
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Sequence 9, Appli 
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17 


9 
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US -09-169-786-4 


Sequence 4, Appli 
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17 
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US - 09-843-22 lA -14 


Sequence 14, Appl 
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Sequence 11, Appl 
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Sequence 1, Appli 
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Sequence 111, App 
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US -09-8 64 -761-35098 


Sequence 35098, A 
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3 


28 
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US -09-8 64 -761-44536 


Sequence 44536, A 


257 
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28 


10 


US - 09-071-838-97 


Sequence 97, Appl 
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US-09-790-622-9 


Sequence 9, Appli 
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Sequence 662, App 


250 
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14 


3 
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Sequence 662, App 
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4 


14 


3 


28 


15 


US-10 -213 -512-97 


Sequence 97, Appl 
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Sequence 9, Appli 
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Sequence 42588, A 
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Sequence 638, App 
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Sequence 152, App 
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US- 09 -843 -22 lA- 152 


Sequence 152, App 


O ""7 

267 


4 


14 


3 


29 


12 


US-10 -351 -641-379 


Sequence 379, App 


268 


4 


14 
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12 


US -10-351- 64 1-590 


Sequence 590, App 


269 
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14 
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29 


12 


US-10 -351- 641-596 


Sequence 596, App 


270 


4 


14 


3 


29 


12 


US -10-351- 64 1-1 127 


Sequence 1127, Ap 


271 


4 


14 


3 


29 


12 


US -10- 22 7-577-638 


Sequence 638, App 


272 


4 


14 


3 


29 


12 


US -10-2 64 -04 9-3239 


Sequence 323 9, Ap 
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14 


3 


29 


15 


US -10-214-188-11 


Sequence 11, Appl 


274 


4 


14 


3 


29 


15 


US-10 -092 -750 -45 


Sequence 45, Appl 


275 


4 


14 


3 


29 


15 


US-10-000-256A-218 


Sequence 218, App 
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Sequence 638, App 
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US-09-864-761-40825 


Sequence 4 0825, A 


278 


4 


14, 


3 


30 


9 


US-09-878-905-5 


Sequence 5, Appli 


279 
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14 


3 


30 


9 


US-09-878-905-7 


Sequence 7, Appli 
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14. 


3 


30 


9 


US-09-764-860-456 


Sequence 456, App 


281 


4 


14. 


3 


30 


11 


US-09-910-180-16 


Sequence 16, Appl 


282 


4 


14. 


3 


30 


12 


US-10-351-641-1521 


Sequence 1521, Ap 


283 


4 


14. 


3 


30 


12 


US-10-029-386-30521 


Sequence 30521, A 


284 


4 


14. 


3 


30 


12 


US-10-212-872-456 


Sequence 456, App 



285 


4 


14. 


3 


30 


12 


US-10-264-049-3744 


Sequence 3744, Ap 


286 


4 


14 . 


3 


30 


15 


US-10-074-095-456 


Sequence 456, App 


287 
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14 . 


3 


31 


9 


US- 09 -864 -761-33755 


Sequence 33755, A 


288 


4 


14 . 


3 


31 


9 


US-09-864-761-40876 


Sequence 4 0876, A 


289 


4 


14 . 


3 


31 


10 


US-09-840-277-55 


Sequence 55, Appl 


290 


4 


14 . 


3 


31 


10 


US-09-840-277-89 


Sequence 89, Appl 


291 


4 


14 . 


3 


31 


11 


US-09-932-367A-29 


Sequence 29, Appl 


292 


4 


14 . 


3 


31 


11 


US-09-932-367A-33 


Sequence 33, Appl 


293 


4 


14 . 


3 


31 


11 


US-09-809-391-648 


Sequence 648, App 


294 


4 


14 . 


3 


31 


12 


US-09-882-171-648 


Sequence 648, App 


295 


4 


14. 


3 


31 


12 


US-10-351-641-589 


Sequence 589, App 


296 


4 


14 . 


3 


31 


12 


US-10-351-641-595 


Sequence 595, App 


297 


4 


14. 


,3 


31 


12 


US-10-062-599-158 


Sequence 158, App 


298 


4 


14 . 


,3 


31 


12 


US-10-213-053A-11 


Sequence 11, Appl 


299 


4 


14 . 


,3 


31 


12 


US-10-269-695-344 


Sequence 344, App 


300 


4 
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.3 


31 


12 


US-10-410-998-344 


Sequence 344, App 


301 


4 


14 , 


.3 


31 


14 


US-10-097-079-11 


Sequence 11, Appl 


302 


4 


14. 


.3 


31 


14 


US-10-097-079-15 


Sequence 15, Appl 


303 


4 


14 . 


.3 


31 


14 


US-10-097-079-28 


Sequence 28, Appl 


304 


4 


14 . 


.3 


31 


14 


US-10-097-079-31 


Sequence 31, Appl 


305 


4 


14 . 


.3 


31 


14 


US-10-097-079-40 


Sequence 40, Appl 


306 


4 


14, 


.3 


31 


14 


US-10-097-079-88 


Sequence 88, Appl 


307 


4 


14 , 


.3 


31 


15 


US-10-082-830-189 


Sequence 18 9, App 


308 


4 


14 , 


.3 


31 


15 


US-10-002-884A-4 


Sequence 4, Appli 


309 


4 


14, 


.3 


31 


15 


US-10-062-831-158 


Sequence 158, App 


310 


4 


14 


.3 


32 


9 


US-09-864-761-38615 


Sequence 38615, A 


311 


4 


14 


.3 


32 


9 


US-09-864-761-39194 


Sequence 3 9194, A 


312 


4 


14 


.3 


32 


12 


US-10-316-253-211 


Sequence 211, App 


313 


4 


14 


.3 


32 


12 


US-10-029-386-31729 


Sequence 31729, A 


314 


4 


14 


.3 


32 


14 


US-10-060-714-9 


Sequence 9, Appli 


315 


4 


14 


,3 


32 


15 


US-10-174-410-200 


Sequence 2 00, App 


316 


4 


14 


.3 


32 


15 


US-10-174-410-209 


Sequence 2 09, App 


317 


4 


14 


.3 


32 


15 


US-10-174-410-243 


Sequence 243, App 


318 


4 


14 


.3 


32 


15 


US-10-174-410-277 


Sequence 277, App 


319 


4 


14 


.3 


32 


15 


US-10-002-884A-1 


Sequence 1, Appli 


320 


4 


14 


.3 


33 


9 


US-09-864-761-48332 


Sequence 48332, A 


321 


4 


14 


.3 


33 


9 


US-09-864-761-49019 


Sequence 4 9019, A 


322 


4 


14 


.3 


33 


10 


US-09-785-546-4 


Sequence 4, Appli 


323 


4 


14 


.3 


33 


10 


US-09-785-546-5 


Sequence 5, Appli 


324 


4 


14 


.3 


33 


11 


US-09-843-221A-116 


Sequence 116, App 


325 


4 


14 


.3 


33 


12 


US-10-289-135A-37 


Sequence 37, Appl 


326 


4 


14 


.3 


33 


12 


US-10-351-641-588 


Sequence 588, App 


327 


4 


14 


.3 


33 


12 


US-10-351-641-594 


Sequence 5 94, App 


328 


4 


14 


.3 


33 


12 


US-10-189-437-664 


Sequence 664, App 


329 


4 


14 


.3 


34 


9 


US-09-776-490-26 


Sequence 26, Appl 


330 


4 


14 


.3 


34 


9 


US-09-776-491-26 


Sequence 26, Appl 


331 


4 


14 


.3 


34 


9 


US-09-864-761-35232 


Sequence 35232, A 


332 


4 


14 


.3 


34 


9 


US-09-864-761-42358 


Sequence 42358, A 


333 


4 


14 


.3 


34 


9 


US-09-864-761-43442 


Sequence 43442, A 


334 


4 


14 


.3 


34 
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US-09-864-761-44916 


Sequence 44 916, A 


335 


4 


14 
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34 
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US- 09 -8 64 -761 -452 81 


Sequence 45281, A 
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14 


.3 


34 


9 


US-09-864-761-46156 


Sequence 46156, A 


337 
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14 


.3 


34 
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US-09-917-340-44 


Sequence 44, Appl 


338 


4 


14 


.3 


34 


9 


US-09-917-340-83 


Sequence 83, Appl 


339 


4 


14 


.3 


34 


10 


US-09-785-546-3 


Sequence 3, Appli 


340 


4 


14 


.3 


34 


12 


US-09-840-743-75 


Sequence 75, Appl 


341 


4 


14 


.3 


34 


12 


US-10-195-730-256 


Sequence 256, App 
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14 . 


3 


34 


12 


US-09-765-815-7 


Sequence 


1, Appli 
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14 , 


3 


34 


12 


US-09-840-085-53 


Sequence 


53, Appl 
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34 


12 


US-09-840-085-54 


Sequence 


54, Appl 
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3 


34 


12 


US-09-840-085-55 


Sequence 


55, Appl 


346 


4 


14 . 


3 


34 


12 


US-09-840-085-56 


Sequence 


56, Appl 


347 
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14 . 


3 


34 


12 


US-10-340-484-24 


Sequence 


24, Appl 


348 
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3 


34 


12 


US-09-933-767-1138 


Sequence 


1138, Ap 


349 
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34 


12 


US-10-351-641-932 


Sequence 


932, App 


350 
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14 . 


3 


34 


12 


US-10-029-386-29690 


Sequence 


29690, A 


351 
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3 


34 
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US-10-023-282-1138 


Sequence 


1138, Ap 
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US-09-864-761-44061 


Sequence 44 061, A 
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35 
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US-09-864-761-45676 


Sequence 45676, A 
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US-10-012-952A-147 


Sequence 


147, App 
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12 


US-10-351-641-107 


Sequence 


107, App 


356 
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14 . 


,3 


35 


12 


US-10-351-641-108 


Sequence 


108, App 


357 


4 


14 . 


,3 


35 


12 


US-10-351-641-109 


Sequence 


109, App 
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12 


US-10-351-641-110 


Sequence 


110, App 
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35 


12 


US-10-351-641-111 


Sequence 


111, App 
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35 


12 


US-10-351-641-112 


Sequence 


112, App 
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12 


US-10-351-641-113 


Sequence 


113, App 


362 


4 
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12 


US-10-351-641-114 


Sequence 


114, App 
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12 


US-10-351-641-115 


Sequence 


115, App 
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14 , 
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12 


US-10-351-641-679 


Sequence 


679, App 
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14 , 
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35 


12 


US-10-351-641-680 


Sequence 


68 0, App 


366 


4 


14 , 
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US-10-351-641-681 


Sequence 


681, App 
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US-10-351-641-682 


Sequence 


682, App 
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12 


US-10-351-641-684 


Sequence 


684, App 
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12 


US-10-351-641-685 


Sequence 


685, App 
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US-10-351-641-686 


Sequence 


686, App 


371 


4 


14 


.3 


35 


12 


US-10-351-641-689 


Sequence 


689, App 
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US-10-351-641-693 


Sequence 


693, App 
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US-10-351-641-706 


Sequence 


706, App 
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US-10-351-641-707 


Sequence 


707, App 


375 


4 


14 


.3 


35 


12 


US-10-351-641-708 


Sequence 


708, App 


376 
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US-10-351-641-709 


Sequence 


709, App 
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US-10-351-641-710 


Sequence 


710, App 
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US-10-351-641-711 


Sequence 


711, App 
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US-10-351-641-712 


Sequence 


712, App 


380 


4 


14 


.3 


35 


12 


US-10-351-641-713 


Sequence 
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US-10-351-641-714 


Sequence 


714, App 
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US-10-351-641-715 


Sequence 


715, App 
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US-10-351-641-718 


Sequence 


718, App 
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Sequence 


726, App 
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US-10-351-641-740 


Sequence 
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US-10-351-641-759 


Sequence 


759, App 
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US-10-351-641-766 


Sequence 


766, App 
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US-10-351-641-767 


Sequence 


767, App 
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Sequence 


768, App 
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Sequence 
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Sequence 


832, App 
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Sequence 
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US-10-351-641-917 


Sequence 


917, App 
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US-10-351-641-918 


Sequence 


918, App 
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Sequence 


924, App 
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US-10-351-641-1524 


Sequence 
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US-10-351-641-1525 


Sequence 


1525, Ap 
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US-10-351-641-1526 


Sequence 


1526, Ap 
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US-10-351-641-1527 


Sequence 1527, Ap 
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US-10-351-641-1528 


Sequence 1528, Ap 
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US-10-351-641-1535 


Sequence 1535, Ap 
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Sequence 1536, Ap 
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3 


35 


12 


US-10-351-641-1687 


Sequence 1687, Ap 


4 04 


4 


14 , 


3 


35 


15 


US-10-158-742A-9 


Sequence 9, Appli 


4 05 


4 


14 . 


3 


36 


8 


US-08-851-965-24 


Sequence 24, Appl 


4 06 


4 


14 . 


3 


36 


9 


US- 09 -864-761-34385 


Sequence 34385, A 




4 


14 . 


3 


36 


9 


US- 09 -864 -761 -37610 


Sequence 37610, A 


4 08 


4 


14 . 


3 


36 


9 


US- 09 -864 -761-4 0701 


Sequence 40701, A 


409 


4 


14 . 


3 


36 


9 


US- 09 -864 -761-47597 


Sequence 47597, A 


410 


4 


14 . 


3 


36 


9 


US- 09 -864 -761 -48797 


Sequence 48797, A 


411 


4 


14 . 


3 


36 


10 


US-09-454-533-30 


Sequence 30, Appl 


412 


4 


14 . 


3 


36 


11 


US-09-813-153-261 


Sequence 261, App 


4 13 


4 


14 . 


3 


36 


12 


US-10-299-003-53 


Sequence 53, Appl 


4 14 


4 


14 , 


3 


36 


12 


US-10-234-816-100 


Sequence 100, App 


415 


4 


14 , 


. 3 


36 


12 


US- 10-029-386-27707 


Sequence 27707, A 


4 16 


4 


14 . 


.3 


36 


12 


US-10-029-386-31370 


Sequence 31370, A 


417 


4 


14 , 


, 3 


36 


14 


US-10-016-969-1 


Sequence 1, Appli 


418 


4 


14 . 


. 3 


36 


14 


US-10-016-157A-248 


Sequence 248, App 


419 


4 


14 . 


.3 


36 


15 


US-10-124-903-17 


Sequence 17, Appl 


420 


4 


14 , 


. 3 


37 


8 


US-08-851-965-22 


Sequence 22, Appl 


421 


4 


14 , 


. 3 


37 


8 


US-08-851-965-23 


Sequence 23, Appl 


422 


4 


14 , 


. 3 


37 


8 


US-08-851-965-25 


Sequence 25, Appl 


423 


4 


14 , 


. 3 


37 


8 


US-08-851-965-26 


Sequence 26, Appl 


424 


4 


14 , 


. 3 


37 


8 


US-08-851-965-27 


Sequence 27, Appl 


425 


4 


14 


.3 


37 


9 


US-09-812-528-15 


Sequence 15, Appl 


426 


4 


14 


. 3 


37 


10 


US-09-764-877-1580 


Sequence 1580, Ap 


427 


4 


14 


. 3 


37 


10 


US-09-908-805B-79 


Sequence 79, Appl 


428 


4 


14 


. 3 


37 


10 


US-09-454-533-6 


Sequence 6, Appli 


429 


4 


14 


. 3 


37 


10 


US-09-454-533-28 


Sequence 28, Appl 


430 


4 


14 


. 3 


37 


10 


US-09-454-533-29 


Sequence 29, Appl 


431 


4 


14 


. 3 


37 


10 


US-09-454-533-31 


Sequence 31, Appl 


432 


4 


14 


. 3 


37 


10 


US-09-454-533-32 


Sequence 32, Appl 


433 


4 


14 


. 3 


37 


10 


US-09-454-533-33 


Sequence 33, Appl 


434 


4 


14 


.3 


37 


11 


US-09-855-698-6 


Sequence 6, Appli 


435 


4 


14 


.3 


37 


12 


US-10-160-162-195 


Sequence 195, App 


436 


4 


14 


. 3 


37 


12 


US-10-029-386-28688 


Sequence 28688, A 


437 


4 


14 


. 3 


37 


12 


US-09-820-649-195 


Sequence 195, App 


438 


4 


14 


. 3 


37 


15 


US-10-000-256A-160 


Sequence 160, App 


439 


4 


14 


. 3 


38 


9 


US- 09 -864 -76 1-39905 


Sequence 3 9905, A 


440 


4 


14 


. 3 


38 


9 


US -09 -8 64 -761 -43 123 


Sequence 43123, A 


441 


4 


14 


. 3 


38 


12 


US-10-289-228-48 


Sequence 48, Appl 


442 


4 


14 


. 3 


.38 


12 


US-10-289-228-50 


Sequence 50, Appl 


443 


4 


14 


. 3 


38 


12 


US -10 -373 -592 -4 8 


Sequence 48, Appl 


444 


4 


14 


. 3 


38 


12 


US-10-373-592-50 


Sequence 50, Appl 


445 


4 


14 


.3 


38 


12 


US-10-373-592-69 


Sequence 69, Appl 


446 


4 


14 


- 3 


38 


12 


US-10-373-592-74 


Sequence 74 , Appl 


447 


4 


14 


. 3 


38 


12 


US- 10 -264 -049 -32 08 


Sequence 3208, Ap 


448 


4 


14 


.3 


38 


15 


US-10-050-704-201 


Sequence 201, App 


449 


4 


14 


.3 


38 


15 


US-10-197-954-112 


Sequence 112, App 


450 


4 


14 


.3 


39 


9 


US-09-730-504-4 


Sequence 4, Appli 


451 


4 


14 


.3 


39 


9 


US-09-864-761-42664 


Sequence 42664, A 


452 


4 


14 


.3 


39 


11 


US-09-809-391-644 


Sequence 644, App 


453 


4 


14 


.3 


39 


12 


US-09-882-171-644 


Sequence 644, App 
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14 


.3 


39 


12 


US-10-105-232-275 


Sequence 275, App 


455 
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.3 


39 


12 


US-10-189-437-262 


Sequence 262, App 
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Sequence 


504, App 
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Sequence 


869, App 
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761-38727 


Sequence 


38727, A 
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40 
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864- 


761-41416 


Sequence 


41416, A 
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3 


40 


9 
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09- 


864- 


761-46448 


Sequence 


46448, A 
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40 


10 
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-09 


-989 


-919-87 


Sequence 


87, Appl 
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40 


12 
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-10 


-395 


-897-11 


Sequence 


11, Appl 
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14 
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-105 


-992-11 


Sequence 


11, Appl 
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40 
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-078 


-090-190 


Sequence 


190, App 
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424- 


550B-484 


Sequence 


484, App 


466 
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28 


8 
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08- 
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550B-554 


Sequence 


554, App 


467 
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28 
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216- 


393-283 


Sequence 


283, App 


468 
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28 
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09- 


752- 


724-1 


Sequence 


1, Appli 


469 


3 


10 . 


7 


28 


9 
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09- 


752- 


724-2 


Sequence 


2, Appli 


470 


3 


10 . 


7 


28 


9 


us- 


09- 


799- 


983-15 


Sequence 


15, Appl 


471 


3 


10 . 


7 


28 


9 


us- 


09- 


870- 


162A-31 


Sequence 


31, Appl 


472 


3 


10 . 


7 


28 


9 


us- 


09- 


864- 


761-33674 


Sequence 


33674, A 


473 
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10 , 


7 


28 


9 
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09- 


864- 


761-33837 


Sequence 


33837, A 


474 


3 
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7 


28 


9 
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09- 


864- 


761-35107 


Sequence 


35107, A 


475 
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28 


9 
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09- 


864- 


761-35251 


Sequence 


35251, A 


476 
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28 


9 


us- 


09- 


864- 


761-36085 


Sequence 


36085, A 


477 
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28 
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864- 


761-36747 


Sequence 


36747, A 


478 
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864- 


761-37845 


Sequence 


37845, A 


479 
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864- 


761-38339 


Sequence 


38339, A 


480 
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9 
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761-39217 


Sequence 


39217, A 


481 
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28 
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-864- 


-761-39322 


Sequence 


39322, A 


482 
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7 


28 
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-864- 


-761-39579 


Sequence 


39579, A 


483 
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-864- 


-761-40087 


Sequence 


40087, A 
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28 
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-864- 


-761-40325 


Sequence 


40325, A 


485 
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28 
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■864- 


-761-40453 


Sequence 


40453, A 


486 
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. 7 


28 


9 
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■09- 


-864- 


-761-40836 


Sequence 


40836, A 
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3 


10 
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28 


9 
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-864- 


-761-41680 


Sequence 


41680, A 


488 
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28 
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-09- 


-864- 


-761-41850 


Sequence 


41850, A 


489 
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28 


9 
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-864- 


-761-42080 


Sequence 


42080, A 
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28 


9 
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-864- 


-761-42292 


Sequence 


42292, A 


491 
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28 


9 
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-09- 


-864- 


-761-43270 


Sequence 


43270, A 


492 
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. 7 


28 


9 


us- 


-09- 


-864- 


-761-43683 


Sequence 


43683, A 
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10 


. 7 


28 


9 


us- 


-09- 


-864- 


-761-43904 


Sequence 


43904, A 


494 


3 


10 


. 7 


28 


9 


us- 


-09- 


-864- 


-761-45426 


Sequence 


45426, A 


495 


3 


10 


. 7 


28 


9 


us- 


-09- 


-864- 


-761-45563 


Sequence 


45563, A 


496 


3 


10 


, 7 


28 


9 


us- 


-09- 


-864- 


-761-45594 


Sequence 


45594, A 


497 


3 


10 


. 7 


28 


9 


us- 


-09- 


-864- 


-761-46717 


Sequence 


46717, A 


498 


3 


10 


. 7 


28 


9 


us- 


-09- 


-864- 


-761-47128 


Sequence 


47128, A 


*! J J 


3 


10 


. 7 


28 


9 


US- 


-09- 


-864- 


-761-47458 


Sequence 


47458, A 


500 


3 


10 


. 7 


28 


9 


us- 


-09- 


-864 


-761-47465 


Sequence 


47465, A 


501 


3 


10 


. 7 


28 


9 


us- 


-09- 


-864 


-761-47891 


Sequence 


47891, A 


502 


3 


10 


. 7 


28 


9 


us- 


-09 


-864 


-761-48105 


Sequence 


48105, A 


503 


3 


10 


. 7 


28 


9 


us- 


-09 


-925 


-299-1023 


Sequence 


1023, Ap 


504 


3 


10 


. 7 


28 


9 


us- 


-09 


-728 


-721-15 


Sequence 


15, Appl 


505 


3 


10 


.7 


28 


9 


us- 


-09 


-728 


-721-19 


Sequence 


19, Appl 


506 


3 


10 


.7 


28 


9 


us- 


-09 


-728 


-721-20 


Sequence 


20, Appl 


507 


3 


10 


. 7 


28 


9 


us- 


-09 


-728 


-721-21 


Sequence 


21, Appl 


508 


3 


10 


.7 


28 


9 


us- 


-09 


-782 


-980-162 


Sequence 


162, App 


509 


3 


10 


, 7 


28 


9 


us- 


-09 


-782 


-980-166 


Sequence 


166, App 


510 


3 


10 


. 7 


28 


9 


us 


-09 


-924 


-889-2 


Sequence 


2, Appli 


511 


3 


10 


. 7 


28 


9 


us 


-09 


-929 


-818-127 


Sequence 


127, App 


512 


3 


10 


.7 


28 


9 


us 


-09 


-929 


-818-154 


Sequence 


154, App 



513 


3 


10. 


7 


28 


10 


US-09-935-291A-17 


Sequence 17, Appl 


514 


3 


10. 


7 


28 


10 


US-09-935-291A-19 


Sequence 19, Appl 


515 


3 


10. 


7 


28 


10 


US-09-982-172-128 


Sequence 128, App 


516 


3 


10. 


7 


28 


10 


US-09-982-172-226 


Sequence 226, App 


517 


3 


10 . 


7 


28 


10 


US-09-903-456-116 


Sequence 116, App 


518 


3 


10. 


7 


28 


10 


US-09-921-397-13 


Sequence 13, Appl 


519 


3 


10 . 


7 


28 


10 


US-09-934-060A-20 


Sequence 20, Appl 


520 


3 


10. 


7 


28 


10 


US-09-852-424-76 


Sequence 76, Appl 


521 


3 


10 . 


7 


28 


10 


US-09-852-424-86 


Sequence 86, Appl 


522 


3 


10, 


7 


28 


10 


US-09-852-424-87 


Sequence 87, Appl 


523 


3 


10. 


7 


28 


10 


US-09-852-424-88 


Sequence 88, Appl 


524 


3 


10. 


7 


28 


10 


US-09-852-424-89 


Sequence 89, Appl 


525 


3 


10 . 


7 


28 


10 


US-09-852-424-102 


Sequence 102, App 


526 


3 


10. 


7 


28 


10 


US-09-852-424-103 


Sequence 103, App 


527 


3 


10- 


7 


28 


10 


US-09-852-424-104 


Sequence 104, App 


528 


3 


10. 


7 


28 


10 


US-09-852-424-105 


Sequence 105, App 


529 


3 


10. 


7 


28 


10 


US-09-999-745-34 


Sequence 34, Appl 


530 


3 


10. 


7 


28 


10 


US-09-843-676-115 


Sequence 115, App 


531 


3 


10. 


7 


28 


10 


US-09-554-000-18 


Sequence 18, Appl 


532 


3 


10 . 


7 


28 


10 


US-09-984-245-225 


Sequence 225, App 


533 


3 


10. 


7 


28 


10 


US-09-908-153B-49 


Sequence 49, Appl 


534 


3 


10 . 


,7 


28 


10 


US-09-908-153B-52 


Sequence 52, Appl 


535 


3 


10. 


,7 


28 


10 


US-09-785-059-1 


Sequence 1, Appli 


536 


3 


10. 


.7 


28 


10 


US-09-766-253-115 


Sequence 115, App 


537 


3 


10. 


.7 


28 


11 


US-09-798-889-151 


Sequence 151, App 


538 


3 


10. 


.7 


28 


11 


US-09-438-486-115 


Sequence 115, App 


539 


3 


10 , 


,1 


28 


11 


US-09-986-480-214 


Sequence 214, App 


540 


3 


10, 


.7 


28 


11 


US-09-974-879-310 


Sequence 310, App 


541 


3 


10 


.7 


28 


11 


US-09-974-879-543 


Sequence 543, App 


542 


3 


10 


.7 


28 


11 


US-09-974-879-546 


Sequence 546, App 


543 


3 


10 


.7 


28 


11 


US-09-974-879-573 


Sequence 573, App 


544 


3 


10 


.7 


28 


11 


US-09-974-879-579 


Sequence 579, App 


545 


3 


10 


.7 


28 


11 


US-09-785-058-1 


Sequence 1, Appli 


546 


3 


10 


.7 


28 


11 


US-09-925-299-1023 


Sequence 1023, Ap 


547 


3 


10 


.7 


28 


11 


US-09-776-724A-198 


Sequence 198, App 


548 


3 


10 


.7 


28 


11 


US-09-966-262-225 


Sequence 225, App 


549 


3 


10 


.7 


28 


11 


US-09-983-966-225 


Sequence 225, App 


550 


3 


10 


.7 


28 


11 


US-09-911-569-81 


Sequence 81, Appl 


551 


3 


10 


.7 


28 


11 


US-09-305-736-310 


Sequence 310, App 


552 


3 


10 


.7 


28 


11 


US-09-305-736-545 


Sequence 545, App 


553 


3 


10 


.7 


28 


11 


US-09-305-736-548 


Sequence 548, App 


554 


3 


10 


.7 


28 


11 


US-09-305-736-574 


Sequence 574, App 


555 


3 


10 


.7 


28 


11 


US-09-305-736-580 


Sequence 580, App 


556 


3 


10 


, 7 


28 


11 


US-09-897-776A-20 


Sequence 20, Appl 


557 


3 


10 


. 7 


28 


12 


US-09-911-261A-13 


Sequence 13, Appl 


558 


3 


10 


, 7 


28 


12 


US-09-911-261A-15 


Sequence 15, Appl 


559 


3 


10 


.7 


28 


12 


US-09-911-261A-16 


Sequence 16, Appl 


560 


3 


10 


.7 


28 


12 


US-09-911-261A-17 


Sequence 17, Appl 


561 


3 


10 


.7 


28 


12 


US-09-911-261A-68 


Sequence 68, Appl 


562 


3 


10 


.7 


28 


12 


US-09-911-261A-69 


Sequence 69, Appl 


563 


3 


10 


.7 


28 


12 


US-10-200-879-81 


Sequence 81, Appl 


564 


3 


10 


. 7 


28 


12 


US-10-334-405-4 


Sequence 4, Appli 


565 


3 


10 


.7 


28 


12 


US-10-334-405-5 


Sequence 5, Appli 


566 


3 


10 


.7 


28 


12 


US-10-156-911-116 


Sequence 116, App 


567 


3 


10 


.7 


28 


12 


US-10-315-515-60 


Sequence 60, Appl 


568 


3 


10 


. 7 


28 


12 


US-10-315-515-61 


Sequence 61, Appl 


569 


3 


10 


.7 


28 


12 


US-10-315-515-62 


Sequence 62, Appl 



570 


3 


10 


7 


28 


12 


US- 


10 


-315 


-515-70 


Sequence 


70, Appl 


571 


3 


10 


7 


28 


12 


US- 


09 


-674 


-913A-7 


Sequence 


7, Appli 


572 


3 


10 


7 


28 


12 


US- 


10 


-061 


-607A-45 


Sequence 


45, Appl 


573 


3 


10 
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28 


12 


US- 


10 


-279 


-061-1 


Sequence 


1, Appli 


574 
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7 


28 


12 


US- 


10 


-279 


-061-76 


Sequence 


76, Appl 
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10 


-082 


-828A-224 


Sequence 


224, App 
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10 


-082 


-828A-260 


Sequence 


260, App 
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12 
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10 


-231 


-417-416 


Sequence 


416, App 


578 
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12 
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10 


-314 


-506-9 


Sequence 


9, Appli 


579 


3 


10 


7 
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12 
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10 


-408 


-736-81 


Sequence 


81, Appl 
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28 


12 
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09 


-933 


-767-638 


Sequence 


638, App 
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3 


10 


7 


28 


12 


US- 


09 


-933 


-767-714 


Sequence 


714, App 


582 


3 


10 


7 


28 


12 


US- 


10 


-351 


-641-68 


Sequence 


68, Appl 


583 


3 


10 


7 


28 


12 


US- 


10 


-351 


-641-1068 


Sequence 


1068, Ap 
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7 


28 
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10 


-029 


-386-27629 


Sequence 


27629, A 
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28 


12 


US- 


10 


-029 


-386-27826 


Sequence 


27826, A 


586 
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10 


7 


28 


12 


US- 


10 


-029 


-386-28083 


Sequence 


28083, A 


587 
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10 


7 


28 


12 


US- 


10 


-029 


-386-28127 


Sequence 


28127, A 


588 
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7 


28 


12 
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10 


-029 


-386-28289 


Sequence 


28289, A 
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12 
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10 


-029 


-386-30101 


Sequence 


30101, A 
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12 


US- 


10 


-029 


-386-31055 


Sequence 


31055, A 
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7 
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12 
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10 


-029 


-386-31138 


Sequence 


31138, A 


592 
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12 
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10 


-029 


-386-31429 


Sequence 


31429, A 
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12 
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10 


-029 


-386-31503 


Sequence 


31503, A 
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28 


12 
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10 


-029 


-386-31870 


Sequence 


31870, A 
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. 7 
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12 
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10 


-029 


-386-32354 


Sequence 


32354, A 
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12 
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10 


-062 


-599-173 


Sequence 


173, App 


597 


3 


10 


7 


28 


12 


US- 


10 


-121 


-799-2 


Sequence 


2 , Appl i 


598 


3 


10 


. 7 


28 


12 


us- 


10 


-189 


-437-569 


Sequence 


569, App 


599 


3 


10 


7 


28 


12 


US- 


10 


-321 


-856-283 


Sequence 


283, App 


600 


3 


10 


. 7 


28 


12 


us- 


10 


-404 


-626-2 


Sequence 


2, Appli 


601 


3 


10 


7 


28 


12 


us- 


10 


-080 


-254-100 


Sequence 


100, App 


602 


3 


10 


. 7 


28 


12 


US- 


09 


-818 


-683-310 


Sequence 


310, App 


603 


3 


10 


7 


28 


12 


us- 


09 


-818 


-683-545 


Sequence 


545, App 


604 


3 


10 


7 


28 


12 


us- 


09 


-818 


-683-548 


Sequence 


548, App 


605 


3 


10 


7 


28 


12 


us- 


09 


-818 


-683-574 


Sequence 


574, App 


606 


3 


10 


7 


28 


12 


us- 


09 


-818 


-683-580 


Sequence 


580, App 


607 


3 


10 


7 


28 


12 


us- 


10 


-366 


-125-21 


Sequence 


21, Appl 


608 


3 


10 


7 


28 


12 


us- 


10 


-418 


-552-10 


Sequence 


10, Appl 


609 


3 


10 


7 


28 


12 


US- 


10 


-418 


-552-12 


Sequence 


12, Appl 


610 


3 


10 


7 


28 


12 


us- 


10 


-418 


-552-17 


Sequence 


17, Appl 


611 


3 


10 


7 


28 


12 


us- 


10 


-242 


-355-646 


Sequence 


646, App 


612 


3 


10 


7 


28 


12 


us- 


10 


-154 


-884B-11082 


Sequence 


11082, A 


613 


3 


10 


7 


28 


12 


us- 


10 


-258 


-367-6 


Sequence 


6, Appli 


614 


3 


10 


7 


28 


12 


us- 


10 


-264 


-049-3158 


Sequence 


3158, Ap 


615 


3 


10 


7 


28 


14 


us- 


10 


-001 


-879-127 


Sequence 


127, App 


616 


3 


10 


7 


28 


14 


us- 


10 


-001 


-879-148 


Sequence 


148, App 


617 


3 


10 


7 


28 


14 


us- 


10 


-014 


-269-9 


Sequence 


9, Appli 


618 


3 


10 


7 


28 


14 


US- 


10 


-156 


-820-35 


Sequence 


35, Appl 


619 


3 


10 


7 


28 


14 


us- 


10 


-105 


-931-15 


Sequence 


15, Appl 


620 


3 


10 


7 


28 


14 


us- 


10 


-105 


-931-19 


Sequence 


19, Appl 


621 


3 


10 


7 


28 


14 


us- 


10 


-105 


-931-20 


Sequence 


20, Appl 


622 


3 


10 


7 


28 


14 


us- 


10 


-105 


-931-21 


Sequence 


21, Appl 


623 


3 


10 


7 


28 


14 


us- 


10 


-079 


-075-1 


Sequence 


1, Appli 


624 


3 


10 


7 


28 


14 


us- 


10 


-002 


-974-9 


Sequence 


9, Appli 


625 


3 


10 


7 


28 


14 


us- 


10 


-118 


-984-15 


Sequence 


15, Appl 


626 


3 


10 


7 


28 


14 


us- 


10 


-118 


-984-19 


Sequence 


19, Appl 



627 


3 


10. 


7 


28 


14 


US-10-118-984-20 


Sequence 20, Appl 


628 


3 


10. 


7 


28 


14 


US-10-118-984-21 


Sequence 21, Appl 


629 


3 


10. 


7 


28 


15 


US-10-060-102-3 


Sequence 3, Appli 


630 


3 


10. 


7 


28 


15 


US-10-053-758-115 


Sequence 115, App 


631 


3 


10. 


7 


28 


15 


US-10-092-750-18 


Sequence 18, Appl 


632 


3 


10. 


7 


28 


15 


US-10-092-750-71 


Sequence 71, Appl 


633 


3 


10. 


7 


28 


15 


US-10-054-295-115 


Sequence 115, App 


634 


3 


10. 


7 


28 


15 


US-10-055-713-11 


Sequence 11, Appl 


635 


3 


10. 


7 


28 


15 


US-10-055-713-13 


Sequence 13, Appl 


636 


3 


10. 


7 


28 


15 


US-10-055-713-18 


Sequence 18, Appl 


637 


3 


10. 


7 


28 


15 


US-10-050-704-280 


Sequence 280, App 


638 


3 


10. 


7 


28 


15 


US-10-043-487-284 


Sequence 284, App 


639 


3 


10. 


7 


28 


15 


US-10-097-065-307 


Sequence 307, App 


640 


3 


10. 


7 


28 


15 


US-10-097-065-331 


Sequence 331, App 


641 


3 


10. 


7 


28 


15 


US-10-097-065-351 


Sequence 351, App 


642 


3 


10. 


7 


28 


15 


US-10-097-065-478 


Sequence 478, App 


643 


3 


10. 


7 


28 


15 


US-10-054-611-115 


Sequence 115, App 


644 


3 


10. 


7 


28 


15 


US-10-106-806-5 


Sequence 5, Appli 


645 


3 


10. 


7 


28 


15 


US-10-143-090-225 


Sequence 225, App 


646 


3 


10. 


7 


28 


15 


US-10-059-261-324 


Sequence 324, App 


647 


3 


10, 


7 


28 


15 


US-10-150-111-166 


Sequence 166, App 


648 


3 


10, 


7 


28 


15 


US-10-057-408-13 


Sequence 13, Appl 


649 


3 


10. 


,7 


28 


15 


US-10-057-408-15 


Sequence 15, Appl 


650 


3 


10- 


.1 


28 


15 


US-10-057-408-16 


Sequence 16, Appl 


651 


3 


10. 


,7 


28 


15 


US-10-057-408-17 


Sequence 17, Appl 


652 


3 


10. 


,1 


28 


15 


US-10-057-408-68 


Sequence 68, Appl 


653 


3 


10. 


.7 


28 


15 


US-10-057-408-69 


Sequence 69, Appl 


654 


3 


10. 


.7 


28 


15 


US-10-247-279-9 


Sequence 9, Appli 


655 


3 


10, 


. 7 


28 


15 


US-10-023-282-638 


Sequence 63 8, App 


656 


3 


10, 


.7 


28 


15 


US-10-023-282-714 


Sequence 714, App 


657 


3 


10 


.7 


28 


15 


US-10-203-942-5 


Sequence 5, Appli 


658 


3 


10 


.7 


28 


15 


US-10-062-831-173 


Sequence 173, App 


659 


3 


10 


.7 


28 


15 


US-10-055-711-19 


Sequence 19, Appl 


660 


3 


10 


.7 


28 


15 


US-10-055-711-21 


Sequence 21, Appl 


661 


3 


10 


.7 


28 


15 


US-10-055-711-26 


Sequence 26, Appl 


662 


3 


10 


.7 


28 


15 


US-10-197-954-14 


Sequence 14, Appl 


663 


3 


10 


.7 


28 


15 


US-10-288-220-8 


Sequence 8, Appli 


664 


3 


10 


.7 


28 


15 


US-10-283-500-17 


Sequence 17, Appl 


665 


3 


10 


.7 


28 


15 


US-10-295-981-15 


Sequence 15, Appl 


666 


3 


10 


.7 


28 


15 


US-10-295-981-19 


Sequence 19, Appl 


667 


3 


10 


.7 


28 


15 


US-10-295-981-20 


Sequence 20, Appl 


668 


3 


10 


.7 


28 


15 


US-10-295-981-21 


Sequence 21, Appl 


669 


3 


10 


. 7 


28 


15 


US-10-216-209-70 


Sequence 70, Appl 


670 


3 


10 


.7 


29 


8 


US-08-910-386A-44 


Sequence 44, Appl 


671 


3 


10 


.7 


29 


8 


US-08-913-430-4 


Sequence 4, Appli 


672 


3 


10 


.7 


29 


9 


US-09-729-835-106 


Sequence 106, App 


673 


3 


10 


.7 


29 


9 


US-09-205-658-71 


Sequence 71, Appl 


674 


3 


10 


.7 


29 


9 


US-09-844-353A-71 


Sequence 71, Appl 


675 


3 


10 


.7 


29 


9 


US-09-864-761-33928 


Sequence 33928, A 


676 


3 


10 


.7 


29 


9 


US-09-864-761-33989 


Sequence 33989, A 


677 


3 


10 


.7 


29 


9 


US-09-864-761-36241 


Sequence 36241, A 


678 


3 


10 


.7 


29 


9 


US-09-864-761-38088 


Sequence 38088, A 


679 


3 


10 


.7 


29 


9 


US- 09 -8 64 -761 -38476 


Sequence 38476, A 


680 


3 


10 


.7 


29 


9 


US-09-864-761-39320 


Sequence 3932 0, A 


681 


3 


10 


.7 


29 


9 


US-09-864-761-39413 


Sequence 3 9413, A 


682 


3 


10 


.7 


29 


9 


US-09-864-761-40217 


Sequence 4 0217, A 


683 


3 


10 


.7 


29 


9 


US-09-864-761-40403 


Sequence 4 0403, A 



684 


3 


10 . 


7 


29 


9 


US-09-864-761-40672 


Sequence 4 0672, A 


685 


3 


10 . 


7 


29 


9 


US-09-864-761-40727 


Sequence 4 0727, A 


686 


3 


10 . 


7 


29 


9 


US-09-864-761-40839 


Sequence 4 083 9, A 


687 


3 


10 . 


7 


29 


9 


US-09-864-761-41049 


Sequence 41049, A 


688 


3 


10 . 


7 


29 


9 


US-09-864-761-41733 


Sequence 41733, A 


689 


3 


10- 


7 


29 


9 


US-09-864-761-41808 


Sequence 41808, A 


690 


3 


10 . 


7 


29 


9 


US-09-864-761-42014 


Sequence 42014, A 


691 


3 


10 . 


7 


29 


9 


US-09-864-761-43866 


Sequence 43866, A 




3 


10 . 


7 


29 


9 


US-09 -864 -761-44607 


Sequence 44607, A 


O V J 


3 


10 . 


7 


29 


9 


US -09 -864 -761 -44706 


Sequence 44706, A 


694 


3 


10 . 


7 


29 


9 


US-09 -864 -761-44778 


Sequence 44778, A 


695 


3 


10 . 


7 


29 


9 


US-09-864-761-45581 


Sequence 45581, A 


696 


3 


10 . 


7 


29 


9 


US-09 -864 -761-45948 


Sequence 45948, A 


6 97 


3 


10 . 


7 


29 


9 


US -09 -864 -761-4613 0 


Sequence 4613 0, A 


698 


3 


10. 


7 


29 


9 


US- 09 -8 64 -761-4 6874 


Sequence 46874, A 


699 


3 


10 . 


7 


29 


9 


US-09-925-299-1222 


Sequence 1222, Ap 


700 


3 


10 . 


7 


29 


9 


US-09-764-869-674 


Sequence 674, App 


701 


3 


10 . 


7 


29 


9 


US-09-851-271A-9 


Sequence 9, Appli 


702 


3 


10 , 


7 


29 


9 


US-09-782-980-134 


Sequence 134, App 


703 


3 


10 . 


7 


29 


9 


US-09-782-980-135 


Sequence 135, App 


704 


3 


10 . 


7 


29 


9 


US-09-789-836-13 


Sequence 13, Appl 


705 


3 


10 . 


, 1 


29 


10 


US-09-922-261-435 


Sequence 435, App 


706 


3 


10 . 


. 7 


29 


10 


US-09-764-877-1596 


Sequence 1596, Ap 


707 


3 


10 . 


, 7 


29 


10 


US-09-925-300-994 


Sequence 994, App 


708 


3 


10 . 


, 7 


29 


10 


US-09-779-050A-25 


Sequence 25, Appl 


709 


3 


10 . 


. 7 


29 


10 


US-09-749-637A-84 


Sequence 84, Appl 


710 


3 


10 . 


. 7 


29 


11 


US-09-774-639-293 


Sequence 293, App 


711 


3 


10 . 


. 7 


29 


11 


US-09-983-802-184 


Sequence 184, App 


712 


3 


10 


. 7 


29 


11 


US-09-956-940-38 


Sequence 38, Appl 


713 


3 


10 


. 7 


29 


11 


US-09-995-973-9 


Sequence 9, Appli 


714 


3 


10 


. 7 


29 


11 


US-09-272-975-56 


Sequence 56, Appl 


715 


3 


10 


. 7 


29 


11 


US-09-974-879-308 


Sequence 3 08, App 


716 


3 


10 


. 7 


29 


11 


US-09-974-879-580 


Sequence 580, App 


717 


3 


10 


. 7 


29 


11 


US-09-925-299-1222 


Sequence 1222, Ap 


718 


3 


10 


. 7 


29 


11 


US-09-809-391-384 


Sequence 384, App 


719 


3 


10 


, 7 


29 


11 


US-09-809-391-388 


Sequence 338, App 


720 


3 


10 


. 7 


29 


11 


US-09-809-391-743 


Sequence 743, App 


721 


3 


10 


. 7 


29 


11 


US-09-969-730-269 


Sequence 269, App 


722 


3 


10 


. 7 


29 


11 


US-09-281-495-47 


Sequence 47, Appl 


723 


3 


10 


, 7 


29 


11 


US-09-764-891-2873 


Sequence 2873, Ap 


724 


3 


10 


. 7 


29 


11 


US-09-899-495-70 


Sequence 70, Appl 


725 


3 


10 


. 7 


29 


11 


US-09-305-736-308 


Sequence 308, App 


726 


3 


10 


. 7 


29 


11 


US-09-305-736-581 


Sequence 581, App 


797 


3 


10 


. 7 


29 


11 


US-09-996-484-9 


Sequence 9, Appli 


728 


3 


10 


. 7 


29 


11 


US-09-908-139-19 


Sequence 19, Appl 


729 


3 


10 


. 7 


29 


11 


US-09-908-139-21 


Sequence 21, Appl 


730 


3 


10 


. 7 


29 


11 


US-09-866-066-23 


Sequence 23, Appl 


73 1 


3 


10 


. 7 


29 


12 


US-10-073-333A-32 


Sequence 32, Appl 


732 


3 


10 


. 7 


29 


12 


US-10-195-730-214 


Sequence 214, App 


733 


3 


10 


.7 


29 


12 


US-10-146-999-15 


Sequence 15, Appl 


734 


3 


10 


.7 


29 


12 


US-10-199-869-87 


Sequence 87, Appl 


735 


3 


10 


.7 


29 


12 


US-10-231-894-44 


Sequence 44, Appl 


736 


3 


10 


.7 


29 


12 


US-10-193-109-19 


Sequence 19, Appl 


737 


3 


10 


.7 


29 


12 


US-10-316-253-57 


Sequence 57, Appl 


738 


3 


10 


.7 


29 


12 


US-09-789-831-13 


Sequence 13 , Appl 


739 


3 


10 


.7 


29 


12 


US-10-131-909A-7 


Sequence 7, Appli 


740 


3 


10 


.7 


29 


12 


US-10-160-162-198 


Sequence 198, App 



741 


3 


10 . 


7 


29 


12 


US- 


10- 


160- 


162-286 


Sequence 


286, App 


742 


3 


10 . 


7 


29 


12 


us- 


10- 


411- 


224-161 


Sequence 


161, App 


743 


3 


10 . 


7 


29 


12 


us- 


09- 


942- 


052-704 


Sequence 


704, App 


744 


3 


10 . 


7 


29 


12 


us- 


09- 


882- 


171-384 


Sequence 


384, App 


745 


3 


10 . 


7 


29 


12 


us- 


09- 


882- 


171-388 


Sequence 


388, App 


746 


3 


10 . 


7 


29 


12 


us- 


09- 


882- 


171-743 


Sequence 


743, App 


747 


3 


10 . 


7 


29 


12 


us- 


10- 


008- 


524A-110 


Sequence 


110, App 


748 


3 


10 . 


7 


29 


12 


us- 


10- 


340- 


458-4 


Sequence 


4, Appli 


74 9 


3 


10 . 


7 


29 


12 


us- 


10- 


340- 


458-21 


Sequence 


21, Appl 


750 


3 


10 . 


7 


29 


12 


us- 


09- 


933- 


767-824 


Sequence 


824, App 


751 


3 


10 . 


7 


29 


12 


us- 


09- 


933- 


767-864 


Sequence 


864, App 


752 


3 


10 . 


7 


29 


12 


us- 


09- 


933- 


767-1182 


Sequence 


1182, Ap 


753 


3 


10 - 


7 


29 


12 


us- 


09- 


933- 


767-1187 


Sequence 


1187, Ap 


754 


3 


10 . 


7 


29 


12 


us- 


09- 


963- 


■693-71 


Sequence 


71, Appl 


755 


3 


10 . 


7 


29 


12 


us- 


■10- 


105- 


232-210 


Sequence 


210, App 


756 


3 


10 . 


7 


29 


12 


us- 


10- 


■105- 


■232-319 


Sequence 


319, App 


757 


3 


10 . 


7 


29 


12 


us- 


10- 


131- 


-686A-20 


Sequence 


20, Appl 


758 


3 


10 . 


7 


29 


12 


us- 


•10- 


■280- 


■066-472 


Sequence 


472, App 


759 


3 


10 . 


7 


29 


12 


us- 


■10- 


-289- 


-135A-25 


Sequence 


25, Appl 


760 


3 


10 . 


7 


29 


12 


us- 


-10- 


-192- 


-078-23 


Sequence 


23, Appl 


761 


3 


10 , 


7 


29 


12 


us- 


■10- 


-263- 


-162-6 


Sequence 


6, Appli 


762 


3 


10 , 


, 7 


29 


12 


us- 


-10- 


-422- 


-934-23 


Sequence 


23, Appl 


763 


3 


10 . 


, 7 


29 


12 


us- 


-10- 


-405- 


-231-38 


Sequence 


38, Appl 


764 


3 


10 . 


. 7 


29 


12 


us- 


-09- 


-962- 


-756-1343 


Sequence 


1343, Ap 


765 


3 


10 . 


. 7 


29 


12 


us- 


-09- 


-962- 


-756-1357 


Sequence 


1357, Ap 


766 


3 


10 , 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-27814 


Sequence 


27814, A 


767 


3 


10 . 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-27898 


Sequence 


27898, A 


768 


3 


10 . 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-28398 


Sequence 


28398, A 


769 


3 


10 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-28427 


Sequence 


28427, A 


770 


3 


10 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-30036 


Sequence 


30036, A 


771 


3 


10 


.7 


29 


12 


us- 


-10- 


-029- 


-386-30088 


Sequence 


30088, A 


772 


3 


10 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-30256 


Sequence 


30256, A 


773 


3 


10 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-30373 


Sequence 


30373, A 


774 


3 


10 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-31330 


Sequence 


31330, A 


775 


3 


10 


. 7 


29 


12 


us- 


-10- 


-029- 


-386-31546 


Sequence 


31546, A 


776 


3 


10 


. 7 


29 


12 


us- 


-10- 


-029 


-386-32650 


Sequence 


32650, A 


777 


3 


10 


, 7 


29 


12 


us- 


-10- 


-029 


-386-32712 


Sequence 


32712, A 


778 


3 


10 


, 7 


29 


12 


us 


-10- 


-029 


-386-32812 


Sequence 


32812, A 


779 


3 


10 


. 7 


29 


12 


us 


-10 


-029 


-386-33620 


Sequence 


33620, A 


780 


3 


10 


. 7 


29 


12 


us 


-10 


-189 


-437-197 


Sequence 


197, App 


781 


3 


10 


. 7 


29 


12 


us 


-10 


-189 


-437-306 


Sequence 


306, App 


782 


3 


10 


.7 


29 


12 


us 


-10 


-189 


-437-693 


Sequence 


693, App 


783 


3 


10 


. 7 


29 


12 


us 


-10 


-189 


-437-707 


Sequence 


707, App 


784 


3 


10 


. 7 


29 


12 


us 


-10 


-380 


-558-12 


Sequence 


12, Appl 


785 


3 


10 


. 7 


29 


12 


us 


-09 


-820 


-649-198 


Sequence 


198, App 


786 


3 


10 


, 7 


29 


12 


us 


-09 


-820 


-649-286 


Sequence 


286, App 


787 


3 


10 


. 7 


29 


12 


us 


-10 


-231 


-889-44 


Sequence 


44, Appl 


788 


3 


10 


. 7 


29 


12 


us 


-09 


-818 


-683-308 


Sequence 


308, App 


789 


3 


10 


. 7 


29 


12 


us 


-09 


-818 


-683-581 


Sequence 


581, App 


790 


3 


10 


. 7 


29 


12 


us 


-10 


-350 


-719-110 


Sequence 


110, App 


791 


3 


10 
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3 


10 . 


7 


30 


12 


US-10-139-794-79 


Sequence 


79, Appl 


973 


3 


10 . 


7 


30 


12 


US-10-411-544-4 


Sequence 


4, Appli 


974 


3 


10 . 


7 


30 


12 


US-10-057-475B-10823 


Sequence 


10823, A 


975 


3 


10 . 


7 


30 


12 


US-10-057-475B-10824 


Sequence 


10824, A 


976 


3 


10 


7 


30 


12 


US-10-154-884B-10823 


Sequence 


10823, A 


977 


3 


10 


7 


30 


12 


US- 10 -154 -8 84B- 10824 


Sequence 


10824, A 


978 


3 


10 


7 


30 


13 


US-10-036-568-5 


Sequence 


5, Appli 


979 


3 


10 


7 


30 


14 


US-10-051-843-5 


Sequence 


5, Appli 


980 


3 


10 


7 


30 


14 


US-10-055-430-4 


Sequence 


4, Appli 


981 


3 


10 


7 


30 


14 


US-10-103-448-4 


Sequence 


4, Appli 


982 


3 


10 


7 


30 


14 


US-10-010-114-18 


Sequence 


18, Appl 


983 


3 


10 


7 


30 


14 


US-10-002-344A-14 6 


Sequence 


146, App 


984 


3 


10 


7 


30 


14 


US-10-115-695-33 


Sequence 


33, Appl 


985 


3 


10 


7 


30 


14 


US-10-078-059-23 


Sequence 


23, Appl 


986 


3 


10 


7 


30 


14 


US-10-108-929-4 


Sequence 


4, Appli 


987 


3 


10 


7 


30 


15 


US-10-006-069A-217 


Sequence 


217, App 


988 


3 


10 


. 7 


30 


15 


US-10-116-561-33 


Sequence 


33, Appl 


989 


3 


10 


. 7 


30 


15 


US-10-115-671-33 


Sequence 


33, Appl 


990 


3 


10 


. 7 


30 


15 


US-10-094-699-18 


Sequence 


18, Appl 


991 


3 


10 


. 7 


30 


15 


US-10-091-572-389 


Sequence 


389, App 


992 


3 


10 


, 7 


30 


15 


US-10-091-244A-12 


Sequence 


12, Appl 


993 


3 


10 


.7 


30 


15 


US-10-143-090-268 


Sequence 


268, App 


994 


3 


10 


.7 


30 


15 


US-10-227-629-14 


Sequence 


14, Appl 


995 


3 


10 


,7 


30 


15 


US-10-051-841-5 


Sequence 


5, Appli 


996 


3 


10 


,7 


30 


15 


US-10-059-261-55 


Sequence 


55, Appl 


997 


3 


10 


.7 


30 


15 


US-10-059-261-163 


Sequence 


163, App 


998 


3 


10 


.7 


30 


15 


US-10-115-415-33 


Sequence 


33, Appl 


999 


3 


10 


.7 


30 


15 


US-10-116-260-33 


Sequence 


33, Appl 


1000 


3 


10 


.7 


30 


15 


US-10-165-420-5 


Sequence 


5, Appli 



ALIGNMENTS 



RESULT 1 

US-09-843-221A-69 

; Sequence 69, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 22 lA 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2 001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

: PRIOR FILING DATE: 2000-04-27 



; NUMBER OF SEQ ID NOS : 170 

; SOFTWARE: Patentin version 3.1 

; SEQ ID NO 69 

LENGTH: 28 

TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-69 

Query Match 100.0%; Score 28; DB 11; Length 28; 

Best Local Similarity 100.0%; Pred. No. 4e-22; 

Matches 28; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 1 LLHNLGKSIQDLRRRFFLHHLIAEIHTA 28 

Illlllllllllllllllllllllllll 

Db 1 LLHNLGKSIQDLRRRFFLHHLIAEIHTA 28 



RESULT 2 

US-09-843-221A-169 

; Sequence 169, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE- 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 22 lA 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS: 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 169 

LENGTH: 28 

TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Preferred embodiments - PTHrP 
NAME/KEY: misc_feature 
LOCATION: (28) . - (28) 

OTHER INFORMATION: Optional linker and Fc domain attached at the C- terminus 
US-09-843-221A-169 

Query Match 100.0%; Score 28; DB 11; Length 28; 

Best Local Similarity 100.0%; Pred. No. 4e-22; 

Matches 28; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 LLHNLGKSIQDLRRRFFLHHLIAEIHTA 28 



Illlllllllllllllllllllllllll 

Db 1 LLHNLGKSIQDLRRRFFLHHLIAEIHTA 28 



RESULT 3 

US-09-843-221A-68 

; Sequence 68, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE- 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 221A 

; CURRENT FILING DATE: 2 001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 68 

LENGTH: 28 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-843-221A-68 

Query Match 82.1%; Score 23; DB 11; Length 28; 

Best Local Similarity 100.0%; Pred. No. 5.9e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIIIIIIMIIIIIIIMIII 

Db 6 GKSIQDLRRRFFLHHLIAEIHTA 2 8 



RESULT 4 
US-09-169-786-5 

; Sequence 5, Application US/09169786B 

; Patent No. US20020025929A1 

; GENERAL INFORMATION: 

; APPLICANT: Sato, Masahiko 

; TITLE OF INVENTION: METHOD OF BUILDING AND MAINTAINING BONE 
; FILE REFERENCE: X- 114 80 

; CURRENT APPLICATION NUMBER: US/09/ 169 , 786B 

CURRENT FILING DATE: 1998-10-09 
; EARLIER APPLICATION NUMBER: US 60/061,800 
; EARLIER FILING DATE: 1997-10-14 
; NUMBER OF SEQ ID NOS: 12 
; SOFTWARE: Patentin Ver. 2.0 
; SEQ ID NO 5 
LENGTH: 34 



TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-169-786-5 

Query Match 82.1%; Score 23; DB 9; Length 34; 

Best Local Similarity 100.0%; Fred. No. 7e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIMIMIIIIIIIIMIMIII 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 5 

US-09-423-800-75 

; Sequence 75, Application US/09423800 

; Patent No. US2a020165363Al 

; GENERAL INFORMATION: 

; APPLICANT: SATO, KOH 

; APPLICANT: TSUNENARI , TOSHIAKI 

; APPLICANT: ISHII, KIMIE 

; TITLE OF INVENTION: CACHEXIA REMEDY 

; FILE REFERENCE: 04853-0036 

; CURRENT APPLICATION NUMBER: US/09/423 , 800 

; CURRENT FILING DATE: 1999-11-12 

; PRIOR APPLICATION NUMBER: PCT/JP98/02116 

; PRIOR FILING DATE: 1998-05-13 

; PRIOR APPLICATION NUMBER: JP 125505/1997 

; PRIOR FILING DATE: 1997-05-15 

; PRIOR APPLICATION NUMBER: JP 194445/1997 

; PRIOR FILING DATE: 1997-07-18 

; NUMBER OF SEQ ID NOS : 87 

; SOFTWARE: Patentin Ver. 2.1 

; SEQ ID NO 75 

LENGTH: 34 

TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-423-800-75 

Query Match 82.1%; Score 23; DB 10; Length 34; 

Best Local Similarity 100.0%; Pred. No, 7e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIIIIIIIIIIMIIIIIIII 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 6 

US-09-843-221A-63 

; Sequence 63, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 



; TITLE OF INVENTION: MODUIATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843,221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS: 170 

SOFTWARE: PatentIn version 3.1 
; SEQ ID NO 63 

LENGTH: 34 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-843-221A-63 

Query Match 82.1%; Score 23; DB 11; Length 34; 

Best Local Similarity 100.0%; Pred. No. 7e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIMIIIIIIIIIMIIIIIII 
Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 7 

US-10-337-981-75 

; Sequence 75, Application US/10337981 

; Publication No. US20030138424A1 

; GENERAL INFORMATION: 

; APPLICANT: SATO, KOH 

; APPLICANT: TSUNENARI , TOSHIAKI 

; APPLICANT: ISHII, KIMIE 

; TITLE OF INVENTION: CACHEXIA REMEDY 

; FILE REFERENCE: 04853-0036 

; CURRENT APPLICATION NUMBER: US/10/337,981 

; CURRENT FILING DATE: 2003-01-08 

; PRIOR APPLICATION NUMBER: PCT/JP98/02 116 

; PRIOR FILING DATE: 1998-05-13 

; PRIOR APPLICATION NUMBER: JP 125505/1997 

; PRIOR FILING DATE: 1997-05-15 

; PRIOR APPLICATION NUMBER: JP 194445/1997 

; PRIOR FILING DATE: 1997-07-18 

; NUMBER OF SEQ ID NOS: 87 

; SOFTWARE: PatentIn Ver. 2.1 

; SEQ ID NO 75 

LENGTH: 34 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-337-981-75 

Query Match 82.1%; Score 23; DB 12; Length 34; 

Best Local Similarity 100.0%; Pred. No. 7e-17; 



Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

MIIIIIIIIIMIMIIIIIII 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 8 

US-10-340-484-21 

; Sequence 21, Application US/10340484 

; Publication No. US20030171288A1 

; GENERAL INFORMATION: 

; APPLICANT: Stewart, Andrew F. 

TITLE OF INVENTION: Treatment of Bone Disorders with Skelatal Anabolic 
; TITLE OF INVENTION: Drugs 
; FILE REFERENCE: 25200-501 

; CURRENT APPLICATION NUMBER: US/lO/34 0 , 484 

; CURRENT FILING DATE: 2003-01-10 

; PRIOR APPLICATION NUMBER: 60/347,215 

; PRIOR FILING DATE: 2002-01-10 

; PRIOR APPLICATION NUMBER: 60/353,296 

; PRIOR FILING DATE: 2002-02-01 

; PRIOR APPLICATION NUMBER: 60/368,955 

; PRIOR FILING DATE: 2002-03-28 

; PRIOR APPLICATION NUMBER: 60/379,125 

; PRIOR FILING DATE: 2002-05-08 

; NUMBER OF SEQ ID NOS : 27 

; SOFTWARE: PatentlnVer, 2.1 

; SEQ ID NO 21 

LENGTH: 34 

TYPE : PRT 

ORGANISM: Felis catus 
US-10-340-484-21 

Query Match 82.1%; Score 23; DB 12; Length 34; 

Best Local Similarity 100.0%; Pred. No. 7e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIIIIIIIIIIIIIIMIMI 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 9 
US-10-097-079-2 

; Sequence 2, Application US/10097079 
; Publication No. US20020132973A1 
GENERAL INFORMATION: 

APPLICANT: Condon, Stephen M. 
; Morize, Isabelle 

TITLE OF INVENTION: PEPTIDE PARATHYROID HORMONE ANALOGS 
NUMBER OF SEQUENCES: 88 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Rhone-Poulenc Rorer Inc. 
STREET: 500 Areola Road, Mailstop 3C43 
CITY: Collegeville 
STATE: PA 



COUNTRY: USA 

ZIP: 19426 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
'j OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.3 0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 10/097 , 079 
; FILING DATE: 13-Mar-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/228,990 

FILING DATE: <Unknown> 

APPLICATION NUMBER: US 60/046,472 

FILING DATE: 14-MAY-1997 
ATTORNEY/AGENT INFORMATION: 

NAME: Martin Esq., Michael B. 

REGISTRATION NUMBER: 37,521 

REFERENCE/DOCKET NUMBER: A2678B-WO 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (610) 454-2793 

TELEFAX: (610) 454-3808 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 34 amino acids 

TYPE: amino acid 

STRANDEDNESS : <Unknown> 

TOPOLOGY: No. US20020132973A1 Relevant 
MOLECULE TYPE: peptide 
FRAGMENT TYPE: N-terminal 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-10-097-079-2 

Query Match 82.1%; Score 23; DB 14; Length 34; 

Best Local Similarity 100.0%; Pred. No. 7e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIMIIIIIIIIIIIIIIIIIII 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 10 
US-10-182-018-75 

; Sequence 75, Application US/10182018 
; Publication No. US20030049211A1 
; GENERAL INFORMATION: 

; APPLICANT: CHUGAI SEIYAKU KABUSHIKI KAISHA 

; TITLE OF INVENTION: THERAPEUTIC AND PREVENTIVE AGENTS FOR DENTAL DISEASES 
; FILE REFERENCE: PH-1092-PCT 
; CURRENT APPLICATION NUMBER: US/lO/182 , 018 
; CURRENT FILING DATE: 2002-07-24 
; PRIOR APPLICATION NUMBER: JP 2000-83034 
; PRIOR FILING DATE: 2000-01-25 
; NUMBER OF SEQ ID NOS : 75 
SOFTWARE: PatentlnVer. 2.0 



; SEQ ID NO 75 
LENGTH: 34 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-182-018-75 

Query Match 82,1%; Score 23; DB 15; Length 34; 

Best Local Similarity 100.0%; Fred. No. 7e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IMIIIMIIIIIIIIIIIIMI 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 11 
US-10-169-003-75 

; Sequence 75, Application US/10169003 
; Publication No. US20030124119A1 
; GENERAL INFORMATION: 

; APPLICANT: CHUGAI SEIYAKU KABUSHIKI KAISHA 

; TITLE OF INVENTION: Stabilized Antibody Composition and Pharmaceutical 
Preparation for 

TITLE OF INVENTION: Injection 
; FILE REFERENCE: PH-1093-PCT 
; CURRENT APPLICATION NUMBER: US/lO/169,003 
; CURRENT FILING DATE: 2 002-06-26 
; PRIOR APPLICATION NUMBER: JP 11-375203 
; PRIOR FILING DATE: 1999-12-28 
; NUMBER OF SEQ ID NOS : 75 
; SOFTWARE: PatentIn Ver. 2.0 
; SEQ ID NO 75 

LENGTH: 34 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-169-003-75 

Query Match 82.1%; Score 23; DB 15; Length 34; 

Best Local Similarity 100.0%; Pred. No. 7e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIMIIIIIIIIIIMIMM 

Db 12 GKSIQDLRRRFFLHHLIAEIHTA 34 



RESULT 12 
US-09-843-221A-66 

; Sequence 66, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 



FILE REFERENCE: A-665B 
; CURRENT APPLICATION NUMBER: US/09/843 , 221A 
; CURRENT FILING DATE: 2001-04-26 
; PRIOR APPLICATION NUMBER: 60/266,673 
; PRIOR FILING DATE: 2001-02-06 
; PRIOR APPLICATION NUMBER: 60/214,860 

PRIOR FILING DATE: 2000-06-28 
; PRIOR APPLICATION NUMBER: 60/200,053 
; PRIOR FILING DATE: 2000-04-27 
; NUMBER OF SEQ ID NOS : 170 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 66 
LENGTH: 35 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-66 

Query Match 82.1%; Score 23; DB 11; Length 35; 

Best Local Similarity 100.0%; Pred. No. 7.2e-17; 

Matches 23; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 6 GKSIQDLRRRFFLHHLIAEIHTA 28 

IIIIIIIIIIIIIIIIIIIIIM 
Db 13 GKSIQDLRRRFFLHHLIAEIHTA 35 



RESULT 13 
US-09-843-221A-64 

; Sequence 64, Application US/09843221A 

; Publication No. US20030039654A1 

; GENERAL INFORMATION: 

; APPLICANT: KOSTENUIK, PAUL 

; APPLICANT: LIU, CHUAN-FA 

; APPLICANT: LACEY, DAVID LEE 

; TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

; TITLE OF INVENTION: RELATED PROTEIN 
; FILE REFERENCE: A-665B 

; CURRENT APPLICATION NUMBER: US/09/843 , 221A 

; CURRENT FILING DATE: 2001-04-26 

; PRIOR APPLICATION NUMBER: 60/266,673 

; PRIOR FILING DATE: 2001-02-06 

; PRIOR APPLICATION NUMBER: 60/214,860 

; PRIOR FILING DATE: 2000-06-28 

; PRIOR APPLICATION NUMBER: 60/200,053 

; PRIOR FILING DATE: 2000-04-27 

; NUMBER OF SEQ ID NOS: 170 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 64 

LENGTH: 36 

TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: modified human PTHrP 
US-09-843-221A-64 



CC sperm DNA into a highly condensed, stable and inactive 

CC complex. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 
CC TISSUE SPECIFICITY: Testis. 

CC -!- MASS SPECTROMETRY: MW=3941; METHOD=Electrospray . 

DR GO; GO: 0005718; C :nucleosome; NAS . 

DR GO; GO: 0005634; C: nucleus; NAS. 

DR GO; 00:0003677; F : DNA binding activity; NAS. 

DR GO; 00:0007001; Prchromosome organization and biogenesis (sen. . NAS. 

DR GO; 00:0007076; P:mitotic chromosome condensation; NAS. 

DR GO; GO:0006334; P:nucleosome assembly; NAS. 

DR GO; 00:0007283; P : spermatogenesis ; NAS. 

KW Chromosomal protein; Nucleosome core; Spermatogenesis; 

KW DNA-binding; DNA condensation; Nuclear protein. 

FT DOMAIN 2 15 POLY-ARG. 

FT DOMAIN 17 26 POLY-ARG. 

SQ SEQUENCE 30 AA; 3943 MW; 14F1BC7E4D277049 CRC64; 



Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 13 RRR 15 

111 

Db 2 RRR 4 



RESULT 21 
MMAL^DERMI 

ID MMAL_DERMI STANDARD; PRT; 3 0 AA. 

AC P16312; 

DT Ol-AUG-1990 (Rel . 15, Created) 

DT Ol-AUG-1990 {Rel, 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Major mite fecal allergen Der M 1 (EC 3.4.22.-) (Der M I) (Fragment). 

ON DERMl . 

OS Derma tophago ides microceras (House-dust mite) . 

OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; 

OC Acari formes; Sarcoptif ormes ; Astigmata; Psoroptidia; Analgoidea; 

OC Pyroglyphidae; Derma tophagoides . 

OX NCBI_TaxID=6955; 

RN [1] 

RP SEQUENCE . 

RX MEDLINE=88229138 ; PubMed=3372999 ; 

RA Lind P., Hansen O.C., Horn N.; 

RT "The binding of mouse hybridoma and human IgE antibodies to the major 

RT fecal allergen, Der p I , of Dermatophagoides pteronyssinus . Relative 

RT binding site location and species specificity studied by solid-phase 

RT inhibition assays with radiolabeled antigen."; 

RL J. Immunol. 140:4256-4262(1988). 

CC FUNCTION: THIS PROTEIN IS THE MAJOR ALLERGEN OF HOUSE DUST MITE, 

CC IT IS A THIOL PROTEASE THAT HYDROLYZES PROTEINS, WITH A PREFERENCE 

CC FOR PHE OR BASIC RESIDUES. 

CC -I- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY CI. 

DR PIR; B27634; B27634 . 

DR MEROPS; CO 1.073; -. 



DR InterPro; IPR000169; SHprot_acsite . 

DR PROSITE; PS 0013 9; THIOL_PROTEASE_CYS ; PARTIAL, 

DR PROSITE; PS00639; THIOL_PROTEASE_HIS ; PARTIAL. 

DR PROSITE; PS00640; THIOL_PROTEASE_ASN; PARTIAL. 

KW Hydrolase; Thiol protease; Allergen. 

FT NON_TER 30 30 

SQ SEQUENCE 30 AA; 3314 MW; D60B742967194886 CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Fred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 11 DLR 13 

III 

Db 16 DLR 18 

RESULT 22 
NU02_S0LTU 

ID NU02_S0LTU STANDARD; PRT; 30 AA. 

AC P80268; 

DT Ol-FEB-1994 (Rel . 28, Created) 

DT Ol-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE NADH-ubiquinone oxidoreductase 18 kDa subunit (EC 1.6.5.3} 

DE (EC 1,6.99-3) (Complex I-18KD) (CI-18KD) (Fragment). 

OS Solanum tuberosum (Potato) . 

DC Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 

DC Spermatophyta ; Magnoliophyta; eudicotyledons ; core eudicots; 

OC Asteridae; lamiids; Solanales; Solanaceae; Solanum. 

OX NCBI_TaxID=4113 ; 

RN [1] 

RP SEQUENCE . 

RC TISSUE=Tuber; 

RX MEDLINE=97077345; PubMed=8 919912 ; 

RA Jansch L. , Kruft V., Schmitz U.K., Braun H.P.; 

RT "New insights into the composition, molecular mass and stoichiometry 

RT of the protein complexes of plant mitochondria."; 

RL Plant J. 9:357-368(1996). 
RN [2] 

RP SEQUENCE OF 1-22. 

RC STRAIN=cv. Bintje; TISSUE=Tuber ; 

RX MEDLINE=94124587; PubMed=82 944 84 ; 

RA Herz U., Schroeder W., Liddell A., Leaver C.J., Brennicke A., 

RA Grohmann L . ; 

RT "Purification of the NADH: ubiquinone oxidoreductase (complex I) of 

RT the respiratory chain from the inner mitochondrial membrane of 

RT Solanum tuberosum."; 

RL J. Biol- Chem. 269:2263-2269(1994). 

CC -!- FUNCTION: TRANSFER OF ELECTRONS FROM NADH TO THE RESPIRATORY 

CC CHAIN. THE IMMEDIATE ELECTRON ACCEPTOR FOR THE ENZYME IS BELIEVED 

CC TO BE UBIQUINONE. 

CC -!- CATALYTIC ACTIVITY: NADH + ubiquinone = NAD(+) + ubiquinol . 

CC -I- CATALYTIC ACTIVITY: NADH + acceptor = NAD(+) + reduced acceptor. 

CC -!- SUBUNIT: COMPLEX I IS COMPOSED OF ABOUT 30 DIFFERENT SUBUNITS . 

CC -!- SUBCELLULAR LOCATION: MATRIX SIDE OF THE MITOCHONDRIAL INNER 
CC MEMBRANE . 



KW Oxidoreductase; NAD; Ubiquinone; Mitochondrion. 
FT NON_TER 3 0 30 

SQ SEQUENCE 30 AA; 3263 MW; 2 1AA88B3FAEDFE1D CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100,0%; Pred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 

Db 



12 LRR 14 
III 

24 LRR 26 



RESULT 23 
PRT1_CLUPA 

ID PRT1_CLUPA STANDARD; PRT; 3 0 AA. 

AC P02335; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-DEC-1998 (Rel. 37, Last annotation update) 

DE Protamine YII (Clupeine YII) . 

OS Clupea pallasii (Pacific herring) , and 

OS Clupea harengus (Atlantic herring) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Clupeomorpha ; Clupeidae; 

DC Clupea . 

OX NCBI_TaxID=3 0724, 7950; 

RN [1] 

RP SEQUENCE. 

RC spec: ES=C- pallasii; 

RX MEDLINE=73223106; PubMed=466474 0 ; 

RA Suzuki K. , Ando T. ; 

RT "Studies on protamines. XVI. The complete amino acid sequence of 

RT clupeine YII . " ; 

RL J. Biochem. 72:1419-1432(1972). 

RN [2] 

RP SEQUENCE . 

RC SPECIES^C. harengus; 

RA Chang W. J. , Nukushina M., Ishii S., Nakahara C. , Ando T. ; 

RL Submitted (AUG-1970) to the PIR data bank. 

CC -!- FUNCTION: PROTAMINES SUBSTITUTE FOR HISTONES IN THE CHROMATIN OF 
CC SPERM DURING THE HAPLOID PHASE OF SPERMATOGENESIS- THEY COMPACT 

CC SPERM DNA INTO A HIGHLY CONDENSED, STABLE AND INACTIVE COMPLEX. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 
CC TISSUE SPECIFICITY: Testis. 

DR PIR; A37575; CLHR2A. 

DR PIR; A38052; CLHRY2 . 

KW Chromosomal protein; Nucleosome core; Spermatogenesis; DNA-binding; 

KW Testis; DNA condensation; Nuclear protein. 

SQ SEQUENCE 30 AA; 4049 MW; 7F9BBB80F3ADA566 CRC64; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 



13 RRR 15 
III 



Db 2 RRR 4 

RESULT 24 
PRT2_0NCMY 

ID PRT2_0NCMY STANDARD; PRT; 3 0 AA. 

AC P02331; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Protamine CI I I (Protamine PPC 6B) . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii ; Teleostei; Euteleostei ; 

OC Protacanthopterygii; Sa Imoni formes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8022 ; 

RN [1] 

RP SEQUENCE FROM N.A, (CLONE CIII) . 

RX MEDLINE=8119B983; PubMed=626273 0 ; 

RA Gedamu L. , Wosnick M.A. , Connor W. , Watson D.C., Dixon G.H. , 

RA latrou K. ; 

RT "Molecular analysis of the protamine multi-gene family in rainbow 

RT trout testis . " ; 

RL Nucleic Acids Res. 9:1463-1482(1981). 
RN [2] 

RP SEQUENCE FROM N.A. (CLONE PPC 6B) , 

RX MEDLINE=82030654; PubMed=7287661 ; 

RA Sakai M. , Fuj ii-Kuriyama Y., Saito T., Muramatsu M.; 

RT "Closely related mRNA sequences of protamines in rainbow trout 

RT testis . " ; 

RL J. Biochem. 89:1863-1868(1981). 

CC -I- FUNCTION: PROTAMINES SUBSTITUTE FOR HISTONES IN THE CHROMATIN OF 
CC SPERM DURING THE HAPLOID PHASE OF SPERMATOGENESIS- THEY COMPACT 

CC SPERM DNA INTO A HIGHLY CONDENSED, STABLE AND INACTIVE COMPLEX. 

CC SUBCELLULAR LOCATION: Nuclear. 

CC -!- TISSUE SPECIFICITY: Testis. 

CC MISCELLANEOUS: THE PROTAMINE CIIl/PPC 6B SEQUENCE IS SHOWN. 

CC - 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www. isb-sib. ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; K03 052; -; NOT_ANNOTATED_CDS . 

DR PIR; A02673; IRTRC3 . 

KW Chromosomal protein; Nucleosome core; Spermatogenesis; DNA-binding; 

KW Testis; DNA condensation; Nuclear protein. 



FT 


INIT_MET 


0 


0 




FT 


VARIANT 


10 


10 


V -> I (IN 18% OF THE CIII) 


FT 


VARIANT 


10^ 


10 


V -> P (IN 18% OF THE CIII) 


SQ 


SEQUENCE 


30 AA; 


3991 MW; 


0EBCBF9DC788EFA7 CRC64; 



Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5.5e+03; 



Matches 



3; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 13 RRR 15 

Ml 

Db 2 RRR 4 

RESULT 25 
PRT3_0NCMY 

ID PRT3_0NCMY STANDARD; PRT; 3 0 AA. 

AC P02332; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT Ol-APR-1993 (Rel. 25, Last annotation update) 

DE Protamine IB. 

OS Oncorhynchus mykiss {Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii; Salmon! formes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8022; 

RN [1] 

RP SEQUENCE FROM N.A. (CLONES PRTP178 AND PRTP94) . 

RX MEDLINE=81198983; PubMed-626273 0 ; 

RA Gedamu L., Wosnick M.A. , Connor W. , Watson D.C., Dixon G.H., 

RA latrou K. ; 

RT "Molecular analysis of the protamine multi-gene family in rainbow 

RT trout testis . " ; 

RL Nucleic Acids Res. 9:1463-1482(1981). 

RN [2] 

RP SEQUENCE . 

RX MEDLINE=86274711; PubMed=37553 98 ; 

RA McKay D.J., Renaux B.S., Dixon G.H.; 

RT "Rainbow trout protamines. Amino acid sequences of six distinct 

RT proteins from a single testis. ",- 

RL Eur. J. Biochem. 158:361-366(1986). 

CC -I- FUNCTION: PROTAMINES SUBSTITUTE FOR HISTONES IN THE CHROMATIN OF 
CC SPERM DURING THE HAPLOID PHASE OF SPERMATOGENESIS. THEY COMPACT 

CC SPERM DNA INTO A HIGHLY CONDENSED, STABLE AND INACTIVE COMPLEX. 

CC SUBCELLULAR LOCATION: Nuclear. 

CC -!- TISSUE SPECIFICITY: Testis. 

CC MISCELLANEOUS: BOTH OF THESE SEQUENCES DIFFER FROM ONE OF THE 

CC PROTAMINE CI I I MINOR COMPONENTS IN HAVING 9-ARG INSTEAD OF 9 -PRO. 

DR PIR; A93723; IRTR78 . 

KW Chromosomal protein; Nucleosome core; Spermatogenesis; DNA-binding; 

KW Testis; DNA condensation; Nuclear protein. 

FT INIT_MET 0 0 

SQ SEQUENCE 30 AA; 4064 MW; 0EBCBF9DD1E78 947 CRC64; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5-5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 13 RRR 15 

III 

Db 2 RRR 4 



RESULT 26 
PRT4_0NCMY 

ID PRT4_0NCiyiY STANDARD; PRT; 30 AA. 

AC P02333; 

DT 21~JUL-1986 (Rel . 01, Created) 
DT 21-JUL-1986 (Rel- 01, Last sequence update) 
DT 15-DEC-1998 (Rel. 37, Last annotation update) 
DE Protamine PTP4 . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

DC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii; Salmonif ortnes ; Salmonidae; Oncorhynchus. 

OX NCB I_TaxI D= 8 0 2 2 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=79199790; PubMed-4 50133 / 

RA Jenkins J.R. ; 

RT "Sequence divergence of rainbow trout protamine mRNAs; comparison of 
RT coding and non-coding nucleotide sequences in three protamine cDNA 
RT plasmids . " ; 
RL Nature 279:809-811(1979). 

CC -!- FUNCTION: PROTAMINES SUBSTITUTE FOR HISTONES IN THE CHROMATIN OF 
CC SPERM DURING THE HAPLOID PHASE OF SPERMATOGENESIS. THEY COMPACT 

CC SPERM DNA INTO A HIGHLY CONDENSED, STABLE AND INACTIVE COMPLEX. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 
CC -!- TISSUE SPECIFICITY: Testis. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinformatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X01204; C:AA25623.1; 
DR PIR; A02675; IRTR4 . 

KW Chromosomal protein; Nucleosome core; Spermatogenesis; DNA-binding; 
KW Testis; DNA condensation; Nuclear protein. 
FT INIT_MET 0 0 

SQ SEQUENCE 3 0 AA; 4064 MW; 0EBCBF9DCD979537 CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 13 RRR 15 

Ml 

Db 2 RRR 4 

RESULT 27 
PRTB_ONCMY 

ID PRTB_ONCMY STANDARD; PRT; 3 0 AA. 

AC P12B19; 

DT Ol-OCT-1989 (Rel. 12, Created) 

DT Ol-OCT-1989 (Rel. 12, Last sequence update) 



DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Protamine lA (Protamine PRTP43/PPC 2E) . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii; Salmonif ormes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8022; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=86274711; PubMed=3755398 ; 

RA McKay D.J., Renaux B.S., Dixon G.H.; 

RT "Rainbow trout protamines. Amino acid sequences of six distinct 

RT proteins from a single testis."; 

RL Eur. J. Biochem. 158:361-366(1986). 

RN [2] 

RP SEQUENCE FROM N.A. (CLONE PRTP43) . 

RX MEDLINE=81198983; PubMed=6262730 ; 

RA Gedamu L. , Wosnick M.A., Connor W., Watson D.C., Dixon G.H., 

RA latrou K. ; 

RT "Molecular analysis of the protamine multi-gene family in rainbow 

RT trout testis . " ; 

RL Nucleic Acids Res. 9:1463-1482(1981). 
RN [3] 

RP SEQUENCE FROM N.A. (CLONE PPC 2E) , 

RX MEDLINE=82030654; PubMed=72 87661 ; 

RA Sakai M., Fuj ii-Kuriyama Y., Saito T., Muramatsu M.; 

RT "Closely related mRNA sequences of protamines in rainbow trout 

RT testis."; 

RL J. Biochem. 89:1863-1868(1981), 

CC -I- FUNCTION: PROTAMINES SUBSTITUTE FOR HISTONES IN THE CHROMATIN OF 
CC 'SPERM DURING THE HAPLOID PHASE OF SPERMATOGENESIS. THEY COMPACT 

CC SPERM DNA INTO A HIGHLY CONDENSED, STABLE AND INACTIVE COMPLEX. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 

CC -!- TISSUE SPECIFICITY: Testis. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; K03052; AAA49607.1; 

DR EMBL; K03051; AAA49606.1; -. 

DR PIR; B02673; IRTRC2 . 

DR PIR; 151349; 151349. 

KW Chromosomal protein; Nucleosome core; Spermatogenesis; DNA-binding; 

KW Testis; DNA condensation; Nuclear protein. 
FT INIT_MET 0 0 

SQ SEQUENCE 30 AA; 4050 MW; 0EBCBF9DC78B5947 CRC64; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



13 RRR 15 



Db 



2 RRR 4 



RESULT 28 
PSAM_ODOSI 

ID PSAM_ODOSI STANDARD; PRT; 30 AA. 

AC P49487; 

DT Ol-FEB-1996 (Rel . 33, Created) 

DT Ol-FEB-1996 (Rel. 33, Last sequence update) 

DT Ol-OCT-1996 (Rel. 34, Last annotation update) 

DE Photosystem I reaction centre subunit XII (PSI-M) . 

GN PSAM . 

OS Odontella sinensis (Marine centric diatom) . 

OG Chloroplast . 

OC Eukaryota; stramenopiles ; Bacillariophyta; Coscinodiscophyceae; 

OC Biddulphiophycidae; Eupodiscales ; Eupodiscaceae; Odontella, 

OX NCBI_TaxID-2839; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Kowallik K.V. , Stoebe B., Schaffran I., Kroth-Pancic P., Freier U. ; 

RT "The chloroplast genome of a chlorophyll a+c-containing alga, 

RT Odontella sinensis."; 

RL Plant Mol . Biol. Rep. 13:336-342(1995). 

CC -!- SIMILARITY: BELONGS TO THE PSAM FAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; Z67753; CAA91676.1; 

DR PIR; S78303; S78303 . 

KW Photosystem I; Photosynthesis; Chloroplast. 

SQ SEQUENCE 30 AA; 3329 MW; 73FDEB91E4BF634F CRC64; 



Query Match 10,7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5-5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



21 LIA 23 
III 

12 LIA 14 



RESULT 29 
PSAM_PINTH 

ID PSAM_PINTH STANDARD; PRT; 3 0 AA, 

AC P41601; 

DT Ol-NOV-1995 (Rel. 32, Created) 

DT Ol-NOV-1995 (Rel. 32, Last sequence update) 

DT Ol-OCT-1996 (Rel. 34, Last annotation update) 

DE Photosystem I reaction centre subunit XII (PSI-M) , 

GN PSAM . 



OS Pinus thunbergii (Green pine) {Japanese black pine) . 

OG Chloroplast . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Conif eropsida ; Coniferales; Pinaceae; Pinus . 

OX NCBI_TaxID=33 50; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95024047; PubMed=79378 93 ; 

RA Wakasugi T. , Tsudzuki J., Ito S., Nakashima K. , Tsudzuki T. , 

RA Sugiura M. ; 

RT "Loss of all ndh genes as determined by sequencing the entire 

RT chloroplast genome of the black pine Pinus thunbergii."; 

RL Proc- Natl. Acad. Sci. U.S.A. 91:9794-9798(1994). 
CC SIMILARITY: BELONGS TO THE PSAM FAMILY. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; D17510; BAA04383.1; 

DR EMBL; D17510; BAA04316.1; -. 

DR PIR; T07505; T07505. 

KW Photosystem I; Photosynthesis; Chloroplast. 

SQ SEQUENCE 30 AA; 3321 MW; A15044 1AB5D5AF7C CRC64; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 
Db 



21 



LIA 23 

III 

7 LIA 9 



RESULT 30 
PSAM_PORPU 

ID PSAM_PORPU STANDARD; PRT; 30 AA. 

AC P51395; 

DT Ol-OCT-1996 (Rel . 34, Created) 

DT Ol-OCT-1996 (Rel. 34, Last sequence update) 

DT Ol-OCT-1996 (Rel. 34, Last annotation update) 

DE Photosystem I reaction centre subunit XII (PSI-M) . 

GN PSAM . 

OS Porphyra purpurea. 
OG Chloroplast. 

OC Eukaryota; Rhodophyta; Bangiophyceae; Bangiales; Bangiaceae; Porphyra. 
OX NCBI_TaxID=2787 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Avonport ; 

RA Reith M.E., Munholland J.; 

RT "Complete nucleotide sequence of the Porphyra purpurea chloroplast 
RT genome . " ; 



RL Plant Mol. Biol. Rep. 13:333-335(1995). 

CC SIMILARITY: BELONGS TO THE PSAM FAMILY. 

QC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U38804; AAC08281-1; 

DR PIR; S73316; S73316. 

KW Photosystem I; Photosynthesis; Chloroplast. 

SQ SEQUENCE 30 AA; 3338 MW; 8D193 047 9D8A5527 CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Fred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 5 LGK 7 

III 

Db 24 LGK 26 



RESULT 31 
RL18_HALCU 

ID RL18_HALCU STANDARD; PRT; 3 0 AA. 

AC P05970; 

DT Ol-NOV-1988 (Rel . 09, Created) 

DT Ol-NOV-1988 (Rel. 09, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE SOS ribosomal protein L18P {HCUL18) (HL13) (Fragment) . 

GN RPL18P. 

OS Halobacterium cutirubrum. 

OC Archaea; Euryarchaeota; Halobacteria; Halobacteriales ; 

OC Halobacter iaceae ; Halobacterium . 

OX NCBI_TaxID=2242 ; 

RN [1] 

RP SEQUENCE . 

RX MEDLINE=79045279; PubMed=152199 ; 

RA Smith N., Matheson A.T., Yaguchi M., Willick G, , Nazar R.N.; 

RT "The 5-S RNA-protein complex from an extreme halophile, 

RT Halobacterium cutirubrum. Purification and characterization."; 

RL Eur. J. Biochem, 89:501-509(1978). 

CC -!- SIMILARITY: BELONGS TO THE L18P FAMILY OF RIBOSOMAL PROTEINS. 

DR PIR; S07217; S07217. 

KW Ribosomal protein. 

FT NON_TER 30 30 

SQ SEQUENCE 30 AA; 3624 MW; 3A50079B1569CB74 CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 30; 

Best Local Similarity 100.0%; Pred. No. 5.5e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 



13 RRR 15 
III 



Db 11 RRR 13 

RESULT 32 
C0X4_NEUCR 

ID C0X4_NEUCR STANDARD; PRT; 31 AA. 

AC P06809; 

DT Ol-JAN-1988 (Rel , 06, Created) 

DT Ol-JAN-1988 (Rel. 06, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Cytochrome c oxidase polypeptide IV, mitochondrial precursor 

DE (EC 1.9.3.1) (Fragment). 

GN COX-4 . 

OS Neurospora crassa. 

OC Eukaryota; Fungi; Ascomycota; Pezizomycot ina ; Sordariomycetes ; 

OC Sordariomycetidae; Sordariales; Sordariaceae; Neurospora. 

OX NCBI_TaxID=5141; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=86085927; PubMed=3001085 ; 

RA Sachs M.S., David M. , Werner S., Rajbhandary U.L,; 

RT "Nuclear genes for cytochrome c oxidase subunits of Neurospora 

RT crassa. Isolation and characterization of cDNA clones for subunits 

RT IV, V, VI, and possibly VII ." ; 

RL J- Biol. Chem. 261:869-873(1986). 

CC -!- c:atalytIC activity: 4 ferrocytochrome c + 0(2) =4 f erricytochrome 
CC c + 2 H(2)0. 

CC -!- SUBCELLULAR LOCATION: Mitochondrial inner membrane. 

CC -!- SIMILARITY: BELONGS TO THE CYTOCHROME C OXIDASE VB FAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL out station - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; M12116; AAA33574.1; -. 
DR PIR; A25629; A25629. 

KW Oxidoreductase; Mitochondrion; Inner membrane; Transit peptide. 
FT NON_TER 1 1 

FT TRANSIT <1 16 MITOCHONDRION. 

FT CHAIN 17 >31 CYTOCHROME C OXIDASE POLYPEPTIDE IV. 

FT NON_TER 31 31 

SQ SEQUENCE 31 AA; 3397 MW; 069F5D55103 00362 CRC64; 

Query Match 10.7%; Score 3; DB 1; Length 31; 

Best Local Similarity 100.0%; Pred. No. 5.6e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 12 LRR 14 

III 

Db 5 LRR 7 



RESULT 33 



DEJP_DROiyiE 

ID DEJP_DROME STANDARD; PRT; 31 AA. 

AC P81160; 

DT 15-JUL-1998 (Rel . 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Ductus ejaculatorius peptide 99B. 

GN DUP99B. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID-7227; 

RN [1] 

RP SEQUENCE OF 1-24 FROM N.A., AND SEQUENCE OF 9-31. 

RC STRAIN=Oregon-R; TISSUE=Ductus ejaculatorius; 

RX MEDLINE=21835775; PubMed=11846801 ; 

RA Saudan P., Hauck K. , Soller M., Choffat Y. , Ottiger M., Sporri M., 

RA Ding Z., Hess D. , Gehrig P.M., Klauser S., Hunziker P., Kubli E. ; 

RT "Ductus ejaculatorius peptide 99B (DUP99B) , a novel Drosophila 

RT melanogaster sex-peptide pheromone . " ; 

RL Eur. J. Biochem, 269:98 9-997(2002). 

CC -!- FUNCTION: INDUCES POST-MATING RESPONSES. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: DUCTUS EJACULATORIUS. 

CC -!- SIMILARITY: TO PARAGONIAL PEPTIDE B. 

DR FlyBase; FBgn0024381; Dup99B. 

DR GO; GO: 0045434; P:negative regulation of female receptivity, . . . 

DR GO; GO: 0046662; P: regulation of oviposition; NAS . 

KW Behavior; Glycoprotein; Pyrrolidone carboxylic acid. 

FT MOD_RES 1 1 PYRROLIDONE CARBOXYLIC ACID. 

FT DISULFID 19 31 

FT CARBOHYD 4 4 

SQ SEQUENCE 31 AA; 3766 MW; 



IMP. 



N-LINKED (GLCNAC. . .). 
B90A9B99C120EF49 CRC64; 



Query Match 10,7%; 
Best Local Similarity 100.0% 
Matches 3; Conservative 



Score 3; DB 1; Length 31; 
Pred. NO. 5.6e+03; 
0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



4 NLG 6 
III 

22 NLG 24 



RESULT 34 
GP37_BPSP1 

ID GP3 7_BPSP1 STANDARD; PRT; 31 AA. 

AC 048393; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 15-DEC-1998 (Rel. 37, Last annotation update) 

DE Putative gene 37 protein. 

GN 37. 

OS Bacteriophage SPOl. 

OC Viruses; dsDNA viruses, no RNA stage; Caudovirales ; Myoviridae; 

OC SPOl -like viruses. 

OX NCBI TaxID=10685; 



[1] 

SEQUENCE FROM N.A. 

MEDLINE=98327781; PubMed=9657951 ; 

Stewart C.R., Gaslightwala I., Hinata K. , Krolikowski K.A. , 
Needleman D.S., Peng A.S.-Y., Peterman M.A. , Tobias A., Wei P.; 
"Genes and regulatory sites of the 'host- takeover module' in the 
terminal redundancy of Bacillus subtilis bacteriophage SPOl."; 
Virology 246:329-340(1998) , 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch) . 



RN 
RP 
RX 
RA 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 

CC 

DR EMBL; AF031901; AAC29006.1 

KW Hypothetical protein. 

SQ SEQUENCE 31 AA; 3715 MW 



10-7% 



5ECBA628AF320670 CRC64 ; 



Score 3; DB 1; Length 31; 



Query Match 

Best Local Similarity 100.0%; Pred. No. 5.6e+03; 
Matches 3; Conservative 0; Mismatches 0; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



13 RRR 15 
III 

29 RRR 31 



RESULT 35 
NAP4_HUMAN 

ID NAP4_HUMAN STANDARD; PRT; 31 AA. 

AC P19877; 

DT Ol-FEB-1991 (Rel . 17, Created) 

DT Ol-FEB-1991 (Rel. 17, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Neutrophil -activating protein 4 (NAP-4) (Fragment) . 

OS Homo sapiens (Human) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE . 

RC TISSUE=Platelet ; 

RX MEDLINE=91054515; PubMed=2241978 ; 

RA Schroeder J.-M., Sticherling M. , Persoon N.L.M., Christophers E.; 

RT "Identification of a novel platelet-derived neutrophil -chemotactic 

RT polypeptide with structural homology to platelet-factor 4."; 

RL Biochem. Biophys. Res. Commun. 172:898-904(1990), 

CC -!- FUNCTION: INDUCES CHEMOTAXIS OF NEUTROPHILS. 

CC -!- SIMILARITY: BELONGS TO THE INTERCRINE FAMILY (SMALL CYTOKINE 
CC C-C) (CHEMOKINE CC) . BUT LACK THE C-X-C OR C-C MOTIFS. 

DR PIR; A36162; A36162, 

DR HSSP; P02776; IRHP . 

DR GO; GO: 0008009; F : chemokine activity; TAS, 

DR GO; GO: 0006935; P : chemotaxis ; TAS, 



DR InterPro; IPR000827; CC_cheTnkine_sml . 

DR PROSITE; PS00472; SMALL_CYTOKINES_CC; PARTIAL. 

KW Chemotaxis; Growth factor. 

FT NON_TER 31 31 

SQ SEQUENCE 31 AA; 3447 MW; AD910FB8 94EC1760 CRC64 ; 

Query Match 10.7%; Score 3; DB 1; Length 31; 

Best Local Similarity 100.0%; Pred. No. 5.6e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 10 QDL 12 

Ml 

Db 6 QDL 8 

RESULT 36 
PETL_ODOSI 

ID PETL_ODOSI STANDARD; PRT; 31 AA. 

AC P4 9524; 

DT Ol-FEB-1996 (Rel . 33, Created) 

DT Ol-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Cytochrome b6-f complex subunit VI (Cytochrome b6f complex subunit 

DE petL) . 

GN PETL . 

OS Odontella sinensis (Marine centric diatom) . 

OG Chloroplast- 

OC Eukaryota; stramenopiles ; Bacillariophyta ; Coscinodiscophyceae; 

OC Biddulphiophycidae; Eupodiscales ; Eupodiscaceae ; Odontella. 

OX NCBI_TaxID=2839 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Kowallik K.V., Stoebe B., Schaffran I., Kroth-Pancic P., Freier U. ; 
RT "The chloroplast genome of a chlorophyll a+c-containing alga, 

RT Odontella sinensis."; 

RL Plant Mol. Biol. Rep. 13:336-342(1995). 

CC -!- FUNCTION: THE CYTOCHROME B6-F COMPLEX FUNCTIONS IN THE LINEAR 
CC CROSS -MEMBRANE TRANSPORT OF ELECTRONS BETWEEN PHOTOSYSTEM II AND 

CC I, AS WELL AS IN CYCLIC ELECTRON FLOW AROUND PHOTOSYSTEM I. PETL 

CC IS IMPORTANT FOR PHOTOAUTOTROPHIC GROWTH AS WELL AS FOR ELECTRON 

CC TRANSFER EFFICIENCY AND STABILITY OF THE CYTOCHROME B6-F COMPLEX. 

CC -!- SUBCELLULAR LOCATION: Thylakoid membrane-associated. 
CC -!- SIMILARITY: Belongs to the petL family. 

CC 7" 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 
CC the European Bioinformatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; Z67753 ; CAA91708.1; -. 
DR PIR; S78335; S78335. 
DR HAMAP; MF_00433; -; 1. 
DR Pfam; PF05115; PetL; 1. 

KW Electron transport; Chloroplast; Respiratory chain; Transmembrane; 



KW Thylakoid. 

FT TRANSMEM 4 24 

SQ SEQUENCE 31 AA; 3459 MW; 



POTENTIAL. 

AD65232A8B27853E CRC64 ; 



Query Match 10.7%; Score 3; DB 1; Length 31; 

Best Local Similarity 100.0%; Pred. No. 5.6e+03; 

Matches 3; Conservative. 0; Mismatches 0; Indels 0; Gaps 

Qy 7 KSI 9 

Ml 

Db 26 KSI 28 

RESULT 37 
PRT2_CLUPA 

ID PRT2_CLUPA STANDARD; PRT; 31 AA. 

AC P02336; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-DEC-1998 (Rel. 37, Last annotation update) 

DE Protamine Z (Clupeine Z) . 

OS Clupea pallasii {Pacific herring) , and 

OS Clupea harengus (Atlantic herring) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Clupeomorpha ; Clupeidae; 

OC Clupea . 

OX NCBI_TaxID=30724, 7950; 

RN [1] 

RP SEQUENCE . 

RC SPECIES=C. pallasii; 

RX MEDLINE=71157437; PubMed-555 1645 ; 

RA Iwai K., Nakahara C. , Ando T. ; 

RT "Studies on protamines. XV. The complete amino acid sequence of the Z 

RT component of clupeine. Application of N leads to O acyl rearrangement 

RT and selective hydrolysis in sequence determination."; 

RL J. Biochem. 69:493-509(1971). 

RN [2] 

RP SEQUENCE . 

RC SPECIES=C. harengus; 

RA Chang W.J., Nukushina M, , Ishii S., Nakahara C, Ando T. ; 

RL Submitted (AUG-1970) to the PIR data bank. 

RN [3] 

RP X-RAY CRYSTALLOGRAPHY (2.0 ANGSTROMS). 

RX MEDLINE=92126280; PubMed-1772633 ; 

RA Balschmidt P., Hansen F.B., Dodson E. , Dodson G. , Korber F. ; 

RT "Structure of porcine insulin cocrystallized with clupeine Z."; 

RL Acta Crystallogr. B 47:975-986(1991). 

CC -I- FUNCTION: PROTAMINES SUBSTITUTE FOR HISTONES IN THE CHROMATIN OF 
CC SPERM DURING THE HAPLOID PHASE OF SPERMATOGENESIS. THEY COMPACT 

CC SPERM DNA INTO A HIGHLY CONDENSED, STABLE AND INACTIVE COMPLEX. 

CC -I- SUBCELLULAR LOCATION: Nuclear. 
CC TISSUE SPECIFICITY: Testis, 

CC -!- MISCELLANEOUS: CLUPEINE Z IS PROBABLY THE RESULT OF A CROSSOVER 
CC BETWEEN THE GENES FOR CLUPEINES YI AND YII. 

DR PIR; A37576; CLHRZA. 

DR PIR; A38053; CLHRZ . 

DR PDB; 7 INS; 31- JAN- 94. 



KW Chromosomal protein; Nucleosome core; Spermatogenesis; DNA-binding; 
KW Testis; DNA condensation; Nuclear protein; 3D-structure . 
SQ SEQUENCE 31 AA; 4165 MW; 092CCBF7F3AFC050 CRC64; 

Query Match 10.7%; Score 3; DB 1; Length 31; 

Best Local Similarity 100,0%; Pred. No. 5.6e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 
Db 



13 RRR 15 

III 
2 RRR 4 



RESULT 38 
PRT3_CLUPA 

ID PRT3_CLUPA STANDARD; PRT; 31 AA. 

AC P02337; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-DEC-1998 (Rel. 37, Last annotation update) 

DE Protamine YI (Clupeine YI) . 

OS Clupea pallasii (Pacific herring) , and 

OS Clupea harengus (Atlantic herring) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Clupeomorpha ; Clupeidae; 

OC Clupea . 

OX NCBI_TaxID=30724, 7950; 

RN [1] 

RP SEQUENCE. 

RC SPECIES=C. pallasii; 

RX MEDLINE=73223107; PubMed=4 664741 ; 

RA Suzuki K. , Ando T. ; 

RT "Studies on protamines. XVII. The complete amino acid sequence of 

RT clupeine YI . " ; 

RL J, Biochem. 72:1433-1446(1972). 

RN [2] 

RP SEQUENCE . 

RC SPECI ES=C . harengus ; 

RA Chang W.J., Nukushina M. , Ishii S., Nakahara C. , Ando T.; 

RL Submitted (AUG- 1970) to the PIR data bank. 

CC -!- FUNCTION: PROTAMINES SUBSTITUTE FOR HISTONES IN THE CHROMATIN OF 
CC SPERM DURING THE HAPLOID PHASE OF SPERMATOGENESIS. THEY COMPACT 

CC SPERM DNA INTO A HIGHLY CONDENSED, STABLE AND INACTIVE COMPLEX. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 

CC -!- TISSUE SPECIFICITY: Testis. 

OR PIR; A37577; CLHRIA. 

DR PIR; A38051; CLHRYl . 

KW Chromosomal protein; Nucleosome core; Spermatogenesis; DNA-binding; 

KW Testis; DNA condensation; Nuclear protein. 

SQ SEQUENCE 31 AA; 4112 MW; BDCC9A140FF7F819 CRC64; 

Query Match 10.7%; Score 3; DB 1; Length 31; 

Best Local Similarity 100.0%; Pred. No. 5.6e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 



13 RRR 15 
III 



Db 



2 RRR 4 



RESULT 3 9 
SCK5 ANDMA 



ID SCK5_ANDMA STANDARD; PRT; 31 AA. 

AC P31719; 

DT Ol-JUL-1993 (Rel . 26, Created) 

DT Ol-JUL-1993 (Rel . 26, Last sequence update) 

DT 28-FEB-2003 (Rel . 41, Last annotation update) 

DE Leiurotoxin I -like toxin P05. 

OS Androctonus mauretanicus mauretanicus (Scorpion) . 

OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Scorpiones; 

OC Buthoidea; Buthidae; Androctonus. 

OX NCBI_TaxID=6860; 

RN [1] 

RP SEQUENCE . 

RC TI SSUE= Venom ; 

RX MEDLINE=93215831; PubMed=8385026 ; 

RA Zerrouk H. , Mansuelle P., Benslimane A., Rochat H. , 

RA Martin-Eauclaire M.-F.; 

RT "Characterization of a new leiurotoxin I -like scorpion toxin. P05 

RT from Androctonus mauretanicus mauretanicus ." ; 

RL FEES Lett. 320:189-192(1993). 

RN [2] 

RP SYNTHESIS. 

RX MEDLINE=93208090; PubMed=8457543 ; 

RA Sabatier J.-M., Zerrouk H., Darbon H. , Mabrouk K. , Benslimane A., 

RA Rochat H., Martin-Eauclaire M.-F., van Rietschoten J. ; 

RT "P05, a new leiurotoxin I-like scorpion toxin: synthesis and 

RT structure-activity relationships of the alpha-amidated analog, a 

RT ligand of Ca (2+) -act ivated K+ channels with increased affinity."; 

RL Biochemistry 32:2763-2770(1993). 

RN [3] 

RP STRUCTURE BY NMR. 

RX MEDLINE=94032328; PubMed=8218272 ; 

RA Meunier S., Bernassau J.-M., Sabatier J. -M. , Martin-Eauclaire M.-F., 

RA van Rietschoten J., Cambillau C. , Darbon H. ; 

RT "Solution structure of P05-NH2, a scorpion toxin analog with high 

RT affinity for the apamin-sensitive potassium channel."; 

RL Biochemistry 32:11969-11976(1993). 

CC -I- FUNCTION: Blocker for the small conductance calcium-activated 
CC potassium channels (SK-Ca) (also known as apamine- sensitive 

CC potassium channel) . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Expressed by the venom gland. 

CC -!- SIMILARITY: BELONGS TO THE SHORT SCORPION TOXIN FAMILY. POTASSIUM 
CC CHANNEL INHIBITORS SUBFAMILY. 

DR FIR; A49078; A49078. 

DR PDB; IPNH; 31- JAN- 94. 

DR InterPro; I PRO 01 94 7; Scorpion_tOxinS . 

DR Pfam; PF00451; tOxin_2 ; 1. 

DR PROSITE; PS01138; SC0RP__SH0RT_TOXIN; 1. 

KW Toxin; Neurotoxin; Ionic channel inhibitor; 

KW Potassium channel inhibitor; 3D-structure . 

FT DISULFID 3 21 

FT DISULFID 8 26 



FT 


DISULFID 


12 


28 


FT 


HELIX 


5 


13 


FT 


TURN 


14 


15 


FT 


STRAND 


18 


22 


FT 


TURN 


23 


24 


FT 


STRAND 


25 


29 


SQ 


SEQUENCE 


31 AA; 


3421 



94B06CDC5CFB46FE CRC64 ; 



Query Match 10.7%; 
Best Local Similarity 100.0% 
Matches 3; Conservative 



Score 3; DB 1; Length 31; 
Pred. No. 5.6e+03; 
0; Mismatches 0; Indels 



0; Gaps 



Qy 

Db 



12 LRR 14 

III 

5 LRR 7 



RESULT 4 0 
SCKL_LEIQH 

ID SCKL_LEIQH STANDARD; PRT; 31 AA. 

AC P16341; 

DT Ol-APR-1990 (Rel - 14, Created) 

DT Ol-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Leiurotoxin I (LeTx I) (Scyllatoxin) . 

OS Leiurus quinquestriatus hebraeus (Yellow scorpion) . 

OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Scorpiones; 

OC Buthoidea; Buthidae; Leiurus. 

OX NCBI_TaxID=68 84; 

RN [1] 

RP SEQUENCE . 

RC TISSUE=Venom; 

RX MEDLINE=88273110; PubMed=283 9478 ; 

RA Chicci G.G., Gimenez -Gal lego G., Ber E., Garcia M.L. , Winquist R. , 

RA Cascieri M.A. ; 

RT "Purification and characterization of a unique, potent inhibitor of 

RT apamin binding from Leiurus quinquestriatus hebraeus venom."; 

RL J. Biol. Chem. 263:10192-10197(1988). 

RN [2] 

RP SYNTHESIS. 

RX MEDLINE=90170992; PubMed=23 07683 ; 

RA August e P., Hugues M., Grave B., Gesquiere J.C. , Maes P., Tartar A., 

RA Romey G. , Schweitz H. , Lazdunski M. ; 

RT "Leiurotoxin I (scyllatoxin), a peptide ligand for Ca2 (+) -activated 

RT K+ channels. Chemical synthesis, radiolabeling, and receptor 

RT characterization."; 

RL J. Biol. Chem. 265:4753-4759(1990). 

RN [3] 

RP STRUCTURE BY NMR. 

RX MEDLINE=90127439; PubMed=2 153 586 ; 

RA Martins J.C, Zhang W., Tartar A. , Lazdunski M. , Borremans F.A.M. ; 

RT "Solution conformation of leiurotoxin I (scyllatoxin) by IH nuclear 

RT magnetic resonance. Resonance assignment and secondary structure."; 

RL FEBS Lett. 260:249-253(1990). 

CC -!- FUNCTION: Blocker for the small conductance calcium-activated 
CC potassium channels (SK-Ca) (also known as apamine- sensitive 

CC potassium channel) . 



cc -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Expressed by the venom gland. 

CC SIMILARITY: BELONGS TO THE SHORT SCORPION TOXIN FAMILY. POTASSIUM 

CC CHANNEL INHIBITORS SUBFAMILY. 

DR PIR; A28805; A28805. 

DR PDB; ISCY; 26-JAN-95. 

DR InterPro; IPR001947; Scorpion_toxinS . 

DR Pfam; PF00451; toxin_2 ; 1. 

DR PROSITE; PS01138; SCORP_SHORT_TOXIN; 1. 

KW Toxin; Neurotoxin; Ionic channel inhibitor; 

KW Potassium channel inhibitor; Amidation; 3D-structure . 



FT 


DISULFID 


3 


21 




FT 


DISULFID 


8 


26 




FT 


DISULFID 


12 


28 




FT 


MOD_RES 


31 


31 


AMIDATION. 


FT 


HELIX 


5 


14 




FT 


TURN 


15 


16 




FT 


STRAND 


18 


21 




FT 


STRAND 


26 


29 




SQ 


SEQUENCE 


31 AA; 


3430 MW; 


BB3183DC5CEA53A7 



Query Match 10.7%; Score 3; DB 1; Length 31; 

Best Local Similarity 100.0%; Pred. No. 5.6e+03; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 5 LGK 7 

Ml 

Db 18 LGK 20 



Search completed: January 14, 2004, 10:35:40 
Job time : 5.62305 sees 



